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HCV VARIANTS 

Background of the Invention 

Reference to Government Grant 
5 This invention was made with government support under Public Health Service 

Grants CA 57973 and AI 40034. The government has certain rights in this invention. 

Background of the Invention 

10 (1) Field of the Invention 

The invention relates to materials and methodologies relating to the production and 
use of hepatitis C virus (HCV) variants. More specifically, HCV variants are provided that 
are useful for diagnostic, therapeutic, vaccines and other uses. 

1 5 (2) Description of the Related Art 

Brief general overview of hepatitis C virus 
After the development of diagnostic tests for hepatitis A virus and hepatitis B virus, an 
additional agent, which could be experimentally transmitted to chimpanzees [Alter et al., 
Lancet 1, 459-463 (1978); Hollinger et al., Intervirology 10, 60-68 (1978); Tabor et al., 

20 Lancet 1, 463-466 (1978)], became recognized as the major cause of transfusion-acquired 

hepatitis. cDNA clones corresponding to the causative non-A non-B (NANB) hepatitis agent, 
called hepatitis C virus (HCV), were reported in 1989 [Choo et al., Science 244, 359-362 
(1989)]. This breakthrough has led to rapid advances in diagnostics, and in our understanding 
of the epidemiology, pathogenesis and molecular virology of HCV (For review, see Houghton 

25 et al, Curr Stud Hematol Blood Transfus 61, 1-1 1 (1994); Houghton (1996), pp. 1035-1058 
in FIELDS VIROLOGY, Fields et al., Eds., Raven Press, Philadelphia; Major et al., 
Hepatology 25, 1527-1538 (1997); Reed and Rice, pp. 1-37 in HEPATITIS C VIRUS, 
Reesink, Ed., Karger, Basel; Hagedorn and Rice (1999), THE HEPATITIS C VIRUSES, 
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Springer, Berlin). Evidence of HCV infection is found throughout the world, and the 
prevalence of HCV-specific antibodies ranges from 0.4-2% in most countries to more than 
14% in Egypt [Hibbs et al, J. Inf. Dis. 168, 789-790 (1993)]. Besides transmission via blood 
or blood products, or less frequently by sexual and congenital routes, sporadic cases, not 
5 associated with known risk factors, occur and account for more than 40% of HCV cases 
[Alter et al, J. Am. Med. Assoc. 264, 2231-2235 (1990); Mast and Alter, Semin. Virol. 4, 
273-283(1993)]. Infections are usually chronic [Alter et al, N. Eng. J. Med. 327, 
1899-1905 (1992)], and clinical outcomes range from an inapparent carrier state to acute 
hepatitis, chronic active hepatitis, and cirrhosis which is strongly associated with the 

10 development of hepatocellular carcinoma. 

Although interferon (BFN)-a has been shown to be useful for the treatment of a 
minority of patients with chronic HCV infections [Davis et al, N.Engl.J.Med. 321, 
1501-1506 (1989); DiBisceglie et al, New Engl. J. Med. 321, 1506-1510 (1989)] and 
subunit vaccines show some promise in the chimpanzee model [Choo et al, Proc. Natl Acad. 

15 Set USA 91, 1294-1298 (1994)], future efforts are needed to develop more effective 

therapies and vaccines (See, e.g., Tsambiras et al., 1999, Hepatitis C: Hope on the Horizon, 
Hepatitis C Symposium of 37 th Annual Meeting of the Infectious Diseases Society of 
America, reviewed at 

http://ww.medscape.con^medscape/cno/1999/IDSA/Story.cfm?stoiy_id=913). The 
20 considerable diversity observed among different HCV isolates [for review, see Bukh et al, 
Sem. Liver Dis. 15, 41-63 (1995); Fanning et al., 2000, Medscape Gastroenterology 
2:mgi6558.fann], the emergence of genetic variants in chronically infected individuals 
[Enomoto et al, J. Hepatol. 17, 415-416 (1993); Hijikata et al, Biochem. Eiophys. Res. 
Comm. 175, 220-228 (1991); Kato etal, Biochem. Biophys. Res. Comm. 189, 119-127 
25 (1992); Kato et al, J. Virol. 67, 3923-3930 (1993); Kurosaki et al, Hepatology 18, 

1293-1299 (1993); Lesniewski et al, J. Med. Tirol. 40, 150-156 (1993); Ogata et al, Proc. 
Natl. Acad. Set USA 88, 3392-3396 (1991); Weiner et al, Virology 180, 842-848 (1991); 
Weiner et al, Proc. Natl Acad. Sci. USA 89, 3468-3472 (1992)], and the lack of protective 
immunity elicited after HCV infection [Farci et al, Science 258, 135-140 (1992); Prince et 
30 al, J. Infect. Dis. 165, 438-443 (1992)] present major challenges towards these goals. 



Molecular Biology of HCV 
Classification. Based on its genome structure and virion properties, HCV has been 
classified as a separate genus in the flavivirus family, which includes two other genera: the 
35 flaviviruses (e.g., yellow fever (YF) virus) and the animal pestiviruses (e.g., bovine viral 
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diarrhea virus (BVDV) and classical swine fever virus (CSFV)) [Francki et al, Arch. Virol. 
Suppl. 2, 223 (1991)]. All members of this family have enveloped virions that contain a 
positive-strand RNA genome encoding all known virus-specific proteins via translation of a 
single long open reading frame (ORF). 
5 Structure and physical properties of the virion. Studies on the structure and physical 

properties of the HCV virion have been hampered by the lack of a cell culture system able to 
support efficient virus replication and the typically low titers of infectious virus present in 
serum. The size of infectious virus, based on filtration experiments, is between 30-80 nm 
[Bradley et al, Gastroenterology 88, 773-779 (1985); He et al, J. Infect. Dis. 156, 636-640 

10 (1987); Yuasa et al, J. Gen. Virol. 72, 2021-2024 (1991)]. Initial measurements of the 

buoyant density of infectious material in sucrose yielded a range of values, with the majority 
present in a low density pool of < 1.1 g/ml [Bradley et al., J. Med. Virol. 34, 206-208 
(1991)]. Subsequent studies have used RT/PCR to detect HCV-specific RNA as an indirect 
measure of potentially infectious virus present in sera from chronically infected humans or 

15 experimentally infected chimpanzees. From these studies, it has become increasingly clear 
that considerable heterogeneity exists between different clinical samples, and that many 
factors can affect the behavior of particles containing HCV RNA [Hijikata et al., J. Virol. 67, 
1953-1958 (1993); Thomssen et al., Med. Microbiol. Immunol. 181, 293-300 (1992)]. Such 
factors include association with immunoglobulins [Hijikata et al, (1993) supra] or low 

20 density lipoprotein [Thomssen et al, 1992, supra; Thomssen et al, Med. Microbiol 

Immunol 182,329-334(1993)]. In highly infectious acute phase chimpanzee serum, HCV- 
specific RNA is usually detected in fractions of low buoyant density (1.03-1.1 g/ml) [Carrick 
et al, J. Virol Meth. 39, 279-289 (1992); Hijikata et al, (1993) supra]. In other samples, the 
presence of HCV antibodies and formation of immune complexes correlate with particles of 

25 higher density and lower irifectivity [Hijikata et al, (1993) supra]. Treatment of particles 
with chloroform, which destroys infectivity [Bradley et al, J. Infect. Dis. 148, 254-265 
(1983); Feinstone et al, Infect. Immun. 41, 816-821 (1983)], or with nonionic detergents, 
produced RNA containing particles of higher density (1.17-1.25 g/ml) believed to represent 
HCV nucleocapsids [Hijikata et al, (1993) supra; Kanto et al, Hepatology 19, 296-302 

30 (1994); Miyamoto et al, J. Gen Tirol 73,715-718 (1992)]. 

There have been reports of negative-sense HCV-specific RNAs in sera and plasma 
[see Fong et al, Journal of Clinical Investigation 88:1058-60(1991)]. However, it seems 
unlikely that such RNAs are essential components of infectious particles since some sera with 
high infectivity can have low or undetectable levels of negative-strand RNA [Shimizu et al, 

35 Proc. Natl. Acad. Sci. USA 90: 6037-6041 (1993)]. 
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The virion protein composition has not been rigorously determined, but HCV 
structural proteins include a basic C protein and two membrane glycoproteins, El and E2. 

HCV replication. Early events in HCV replication are poorly understood. A 
hepatocyte receptor may be CD81, which binds the E2 envelope glycoprotein (Peleri et al., 
5 1998, Science 252:938-41). The association of some HCV particles with beta-lipoprotein and 
immunoglobulins raises the possibility that these host molecules may modulate virus uptake 
and tissue tropism. 

Studies examining HCV replication have been largely restricted to human patients or 
experimentally inoculated chimpanzees. In the chimpanzee model, HCV RNA is detected in 

10 the serum as early as three days post-inoculation and persists through the peak of serum 
alanine aminotransferase (ALT) levels (an indicator of liver damage) [Shimizu et al, Proc. 
Natl. Acad. Set USA 87: 6441-6444(1990)]. The onset ofviremia is followed by the 
appearance of indirect hallmarks of HCV infection of the liver. These include the appearance 
of a cytoplasmic antigen [Shimizu et al, (1990) supra] and ultrastructural changes in 

1 5 hepatocytes such as the formation of microtubular aggregates for which HCV previously was 
referred to as the chloroform-sensitive "tubule forming agent" or "TFA" [reviewed by 
Bradley, Prog. Med. Virol. 37: 101-135(1990)]. As shown by the appearance of viral 
antigens [Blight et al, Amer. J. Path. 143: 1568-1573 (1993); Hiramatsu et al, Hepatology 
16: 306-3 1 1 (1992); KrawczynsM et al, Gastroenterology 103: 622-629 (1992); Yamada et 

20 al, Digest. Dis. Sci. 38: 882-887 (1993)] and the detection of positive and negative sense 
RNAs [Fong et al, (1991) supra; Gunji et al., Arch. Virol 134: 293-302 (1994); Haruna et 
al, J. Hepatol. 18: 96-100 (1993); Lamas et al, J. Hepatol. 16: 219-223 (1992); Nouri Aria 
etal, J. Clin.Inves. 91: 2226-34(1993); Sherkeref al, J. Med. Virol. 39:91-96(1993); 
Takehara et al, Hepatology 15: 387-390 (1992); Tanaka et al, Liver 13: 203-208 (1993)], 

25 hepatocytes appear to be a major site of HCV replication, particularly during acute infection 
[Negro et al, Proc. Natl. Acad Sci. USA 89: 2247-2251 (1992)]. Li later stages of HCV 
infection the appearance of HCV-specific antibodies, the persistence or resolution ofviremia, 
and the severity of liver disease, vary greatly both in the chimpanzee model and in human 
patients (Fanning et al., supra). Although some liver damage may occur as a direct 

30 consequence of HCV infection and cytopathogenicity, the emerging consensus is that host 
immune responses, in particular virus-specific cytotoxic T lymphocytes, may play a more 
dominant role in mediating cellular damage. 

It has been speculated that HCV may also replicate in extra-hepatic reservoir(s). In 
some cases, RT/PCR or in situ hybridization has shown an association of HCV RNA with 

35 peripheral blood mononuclear cells including T-cells, B-cells, and monocytes [reviewed in 
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Blight and Gowans, Viral Hepatitis Rev. 1: 143-155 (1995)]. Such tissue tropism could be 
relevant to the establishment of chronic infections and might also play a role in the 
association between HCV infection and certain immunological abnormalities such as mixed 
cryoglobulinemia [reviewed by Ferri et al., Eur. J. Clin. Invest. 23: 399-405 (1993)], 
5 glomerulonephritis, and rare non-Hodgkin's B-lymphomas [Ferri et al, (1993) supra; Kagawa 
etal, Lancet 341: 316-317 (1993)]. However, the detection of circulating negative strand 
RNA in serum, the difficulty in obtaining truly strand-specific RT/PCR [Gunji et al, (1994) 
supra], and the low numbers of apparently infected cells have made it difficult to obtain 
unambiguous evidence for replication in these tissues in vivo. 

10 Genome structure. Full-length or nearly full-length genome sequences of numerous 

HCV isolates have been reported [see, e.g., Lin et al., J. Virol. 68: 5063-5073 (1994a); 
Okamoto etal, J. Gen. Virol. 75: 629-635 (1994); Sakamoto etal, J. Gen. Virol. 75: 
1761-1768 (1994); Trowbridge et al, Arch Virol 743:501-511 (1998); Chamberlain etal, J. 
Gen. Virol. 75:1341-1347 (1997); and citations within Davis, Am. J. Med. 27:21S-26S]. HCV 

15 genome RNAs are ~9.6 kilobases (kb) in length (Figure 1) and consist of a 5' nontranslated 
region (5 ' NTR), a polyprotein coding region consisting of a single long open reading frame 
(ORF), and a 3' NTR. The 5' NTR is 341-344 bases long and highly conserved. The length 
of the long ORF varies slightly among isolates, encoding polyproteins of about 3010 to about 
3033 amino acids. 

20 The 3' NTR can be divided into three domains. The first (most 5') domain shows 

considerable diversity both in composition and length (28-42 bases). Recent work by Yanagi 
et al. [Proc. Natl. Acad. Sci. USA 96:2291-2295(1999)] demonstrate that this region is not 
necessary for virus replication. The second domain is consists of a variable length 
polypyrimidine region of poly(A) (in at least HCV-1, type la [Han et al, Proc. Natl Acad. 

25 Sci. USA 88:1711-1715 (1991)]) or poly(U-UC) (see Chen et al, Virology 188:102-113 
(1992); Okamoto et al, J. Gen. Virol 72:2697-2704 (1991); Tokita et al, J. Gen. Virol. 
66:1476-83 (1994)]. The third domain, at the extreme 3' end of the genome, is a highly 
conserved, novel RNA element of about 98 nucleotides, which is necessary for efficient 
initiation of viral RNA replication [see, e.g., U.S. Patent No. 5,874,565 and U.S. Patent 

30 Application No. 08/81 1,566 (Now U.S. Patent No. ); Kolykhalov et al, J. Virol. 70: 

3363-3371 (1996); Tanaka et al, Biochem. Biophys. Res. Comm. 215: 744-749 (1996); 
Tanakaefa/., /. Virol 70:3307-12 (1996); Yamada et al, Virology 223:255-261 (1996); 
Cheng et al. J. Virol. 73:7044-7049]. This domain and the polypyrimidine regions appear to 
be critical for infectivity in vivo [Yanagi et al., Proc. Natl Acad. Sci. USA P6~:2291-2295 

35 (1999)]. 
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Translation and proteolytic processing. The highly conserved 5' NTR sequence 
contains multiple short AUG-initiated ORFs and shows significant homology with the 5' NTR 
region of pestiviruses [Bukh et al, Proc. Natl Acad. Sci. USA 89: 4942-4946 (1992); Han et 
al, (1991) supra]. A series of stem-loop structures that interact with host factors are present. 
5 These structures interact with host factors to initiate polyprotein synthesis through an internal 
ribosome entry site (IRES) allowing efficient translation initiation at the first AUG of the long 
ORF [Honda et al., J. Virol 75:4941-4951 (1999); Tang et al., J. Virol. 75:2359-2364(1999); 
Psaridi et al., FEBSLett. 453:49-53 (1999)]. Some of the predicted features of the HCV and 
pestivirus IRES elements are similar to one another [Brown et al., (1992) supra]. The ability 
10 of this element to function as an IRES suggests that HCV genome RNAs may lack a 5' cap 
structure. 

The organization and processing of the HCV polyprotein (Figure 1) appears to be 
most similar to that of the pestiviruses. At least 10 polypeptides have been identified and the 
order of these cleavage products in the polyprotein is NH2-C-El-E2-p7-NS2-NS3-NS4A- 

15 NS4B-NS5A-NS5B-COOH. As shown in Figure 1, proteolytic processing is mediated by 
host signal peptidase and two HCV -encoded proteinases, the NS2-3 autoproteinase and the 
NS3-4A serine proteinase [see Rice, In "Fields Virology" (B. N. Fields, D. M. Knipe and P. 
M. Howley, Eds.), Vol. pp. 931-960. Raven Press, New York (1996); Shimotohno et al, J. 
Hepatol. 22: 87-92 (1995) for reviews]. C is a basic protein that serves as the viral core or 

20 capsid protein; El and E2 are virion envelope glycoproteins; p7 is a hydrophobic protein of 
unknown function that is inefficiently cleaved from the E2 glycoprotein [Lin et al, (1994a) 
supra; Mizushima et al, J. Virol. 68: 6215-6222 (1994); Selby et al, Virology 204: 1 14-122 
(1994)]. NS2-NS5B are nonstructural (NS) proteins which function in viral RNA replication 
complexes. Their functions have been identified as follows: NS2 is a metalloprotease; NS3 is 

25 a protease/helicase that contains motifs characteristic of RNA helicases and that has been 

shown to possess an RNA-stimulated NTPase activity [Suzich et al, J. Virol. 67, 6152-6158 
(1993)]; NS4A is a co-factor for NS3; NS4B is of unknown function; NS5A interacts with 
cellular factors to transcriptionally modulate cellular genes and promote cell growth [Ghosh et 
al., J. Biol Chem. 275:7184-7188] and provide IFNa resistance; and NS5B is a replicase that 

30 contains the GDD motif characteristic of the RNA-dependent RNA polymerases of other 
positive-strand RNA viruses. 

Virion assembly and release. This process has not been examined directly, but the 
lack of complex glycans, the ER localization of expressed HCV glycoproteins [Dubuisson et 
al, J. Virol 68: 6147-6160 (1994); Ralston et al, J. Virol 67: 6753-6761 (1993)] and the 

35 absence of these proteins on the cell surface [Dubuisson et al, (1994) supra; Spaete et al, 
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Virology 188: 819-830 (1992)] suggest that initial virion morphogenesis may occur by 
budding into intracellular vesicles. Thus far, efficient particle formation and release has not 
been observed in transient expression assays, suggesting that essential viral or host factors are 
absent or blocked. HCV virion formation and release may be inefficient, since a substantial 
5 fraction of the virus remains cell-associated, as found for the pestiviruses. Extracellular HCV 
particles partially purified from human plasma contain complex N-linked glycans, although 
these carbohydrate moieties were not shown to be specifically associated with El or E2 [Sato 
et al, Virology 196: 354-357 (1993)]. Complex glycans associated with glycoproteins on 
released virions would suggest transit through the trans-Golgi and movement of virions 

10 through the host secretory pathway. If this is correct, intracellular sequestration of HCV 
glycoproteins and virion formation might then play a role in the establishment of chronic 
infections by minimizing immune surveillance and preventing lysis of virus-infected cells via 
antibody and complement. 

Genetic variability. As for all positive-strand RNA viruses, the RNA-dependent 

15 RNA polymerase of HCV (NS5B) is believed to lack a 3'-5' exonuclease proofreading 

activity for removal of misincorporated bases. Replication is therefore error-prone, leading to 
a "quasi-species" virus population consisting of a large number of variants jMartell et al, J. 
Virol. 66: 3225-3229 (1992); Martell et al, J. Virol 68: 3425-3436 (1994)]. This variability 
is apparent at multiple levels. First, in a chronically infected individual, changes in the virus 

20 population occur over time [Ogata et al, (1991) supra; Okamoto et al, Virology 190: 
894-899 (1992)]; and these changes may have important consequences for disease. A 
particularly interesting example is the N-terrninal 30 residue segment of the E2 glycoprotein, 
which exhibits a much higher degree of variability than the rest of the polyprotein [for 
examples, see Higashi et al, Virology 197, 659-668. 1993; Hijikata et al, (1991) supra; 

25 Weiner et al, (1991) supra]. There is accumulating evidence that this hypervariable region, 
called hypervariable region 1 (HVR1), perhaps analogous to the V3 domain of HIV-1 gpl20, 
may be under immune selection by circulating HCV-specific antibodies [Kato et al, (1993) 
supra; Taniguchi et al, Virology 195: 297-301 (1993); Weiner et al, (1992) supra. In this 
model, antibodies directed against this portion of E2 may contribute to virus neutralization 

30 and thus drive the selection of variants with substitutions that permit escape from 
neutralization. This plasticity suggests that a specific amino acid sequence in the E2 
hypervariable region is not essential for other functions of the protein such as virion 
attachment, penetration, or assembly. Genetic evolution of HVR1 within the first 4 months of 
infection has been correlated with the ability of a particular strain of the virus to cause chronic 

35 infection [Farci et al., Science 255:339-344 (2000)]. 
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Genetic variability may also contribute to the spectrum of different responses 
observed after IFN-a treatment of chronically infected patients. Diminished serum ALT 
levels and improved liver histology, which usually correlates with a decrease in the level of 
circulating HCV RNA, is seen in ~40% of those treated [Greiser-Wilke et al, J. Gen. Virol. 
72: 2015-2019 (1991)]. After treatment, approximately 70% of the responders relapse. In 
some cases, after a transient loss of circulating viral RNA, renewed viremia is observed 
during or after the course of treatment. While this might suggest the existence or generation 
of EFN-resistant HCV genotypes or variants, further work is needed to determine the relative 
contributions of virus genotype and host-specific differences in immune response. 

Sequence comparisons of different HCV isolates around the world have also revealed 
enormous genetic diversity [reviewed in Bukh et al, (1995) supra]. Because of the lack of 
biologically relevant serological assays such as cross-neutralization tests, HCV types 
(designated by numbers), subtypes (designated by letters), and isolates are currently grouped 
on the basis of nucleotide or amino acid sequence similarity. Worldwide, HCV has been 
classified into six major genotypes and more than 50 subtypes [Purcell, Hepatology 26: 1 1 S- 
14S (1997)]. Those of greatest importance in the U.S. are genotype 1, subtypes la and lb 
(see below and Bukh et al, (1995) supra for a discussion of genotype prevalence and 
distribution). Amino acid sequence similarity between the most divergent genotypes can be a 
little as ~50%, depending upon the protein being compared. This diversity has important 
biological implications, particularly for diagnosis, vaccine design, and therapy. 

HCV RNA replication. By analogy with other flaviviruses, replication of the positive- 
sense HCV virion RNA is thought to occur via a minus-strand intermediate. This strategy can 
be described briefly as follows: (i) uncoating of the mcoming virus particle releases the 
genomic plus-strand, which is translated to produce a single long polyprotein that is probably 
processed co- and post-translationally to produce individual structural and nonstructural 
proteins; (ii) the nonstructural proteins form a replication complex that utilizes the virion 
RNA as template for the synthesis of minus strands; (iii) these minus strands in turn serve as 
templates for synthesis of plus strands, which can be used for additional translation of viral 
protein, minus strand synthesis, or packaging into progeny virions. Very few details about 
HCV replication process are available, due to the lack of a good experimental system for virus 
propagation. Detailed analyses of authentic HCV replication and other steps in the viral life 
cycle would be greatly facilitated by the development of an efficient system for HCV 
replication in cell culture. 

Many attempts have been made to infect cultured cells with serum collected from 
HCV-infected individuals, and low levels of replication have been reported in a number of 
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cells types infected by this method, including B-cell [Bertolini et al, Res. Virol 144: 
281-285 (1993); Nakajima etal, J. Virol. 70: 9925-9 (1996); Valli et al, Res. Virol. 146:285- 
288 (1995)]. T-cell (Kato etal, Biochem. Biophys. Res. Commun. 205:863-9 (1996); 
Mizutani et al, Biochem. Biophys. Res. Comm. 227:822-826; Mizutani et al, J. Virol. 70: 
5 7219-7223 (1996); Nakajima et al, (1996) supra; Shimizu and Yoshikura, J Virol, 68: 8406- 
8408 (1994); Shimizu et al., Proc. Natl. Acad. Sci USA, 89: 5477-5481 (1992); Shimizu et al, 
Proc. Natl Acad. Sci. USA, 90: 6037-6041 (1993)], and hepatocyte [Kato et al., Jpn. J. 
Cancer Res., 87: 787-92 (1996); Tagawa, J. Gastoenterol and Hepatol, 10: 523-527 (1995)] 
cell lines, as well as peripheral blood monocular cells (PBMCs) [Cribier et al., J. Gen. Virol, 

10 76: 2485-2491 (1995)], and primary cultures of human fetal hepatocytes [Carloni et al., Arch. 
Virol. Suppl. 8: 31-39 (1993); Cribier et al., (1995) supra; Iacovacci et al., Res. Virol, 144: 
275-279 (1993)] or hepatocytes from adult chimpanzees [Lanford et al., Virology 202: 606-14 
(1994)]. HCV replication has also been detected in primary hepatocytes derived from a 
human HCV patient that were infected with the virus in vivo prior to cultivation [Ito et al., J. 

15 Gen. Virol 77: 1043-1054 (1996)] and in the human hepatoma cell line Huh7 following 

transfection with RNA transcribed in vitro from an HCV-1 cDNA clone [Yoo et al., J. Virol, 
69: 32-38 (1995)]. The reported observation of replication in cells transfected with RNA 
derived from the HCV-1 clone was puzzling, since this clone lacks the required terminal 
3'NTR sequence downstream of the homopolymer tract (see below), and because a number of 

20 unusual observations were reported (see the background section of U.S. Patent Application 

No. 08/81 1,566 (Now U.S. Patent No. )). The most well-characterized cell-culture 

systems for HCV replication utilize a B-cell line (Daudi) or T-cell lines persistently infected 
with retroviruses (HPB-Ma or MT-2) [Kato et al., (1995) supra; Mizutani et al., Biochem 
Biophys Res. Comm., 227: 822-826 (1996a); Mizutani et al., (1996) supra; Nakajima et al., 

25 (1996) supra; Shimizu and Yoshikura, (1994) supra]; Shimizu, Proc. Natl. Acad. Sci. USA, 
90: 6037-6041 (1993)]. HPBMa is infected with an amphotropic murine leukemia virus 
pseudorype of murine sarcoma virus, while MT-2 is infected with human T-cell lymphotropic 
virus type I (HTLV-I). Clones (HPBMalO-2 and MT-2C) that support HCV replication more 
efficiently than the uncloned population have been isolated for the two T-cell lines HPBMa 

30 and MT-2 [Mizutani et al. J. Virol. (1996) supra; Shimizu et al., (1993) supra]. However, the 
maximum levels of RNA replication obtained in Ihese lines or in the Daudi lines after 
degradation of the input RNA is still only about 5 x 10 4 RNA molecules per 10 6 cells 
[Mizutani et al., (1996) supra; Mizutani et al., (1996) supra] or 10 4 RNA molecules per ml of 
culture medium [Nakajima et al., (1996) supra]. Although the level of replication is low, 

35 long-term infections of up to 198 days in one system [Mizutani et al., Biochem. Biophys. Res. 
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Comm. 227: 822-826 (1996a)] and more than a year in another system [Nakajima et al., 
(1996) supra] have been documented, and infectious virus production has been demonstrated 
by serial cell-free or cell-mediated passage of the virus to naive cells. 

However, efficient replication of an HCV clone comprising the essential conserved 
5 terminal 3' NTR sequence had not been observed until the work described in co-pending 

application 08/81 1,566, now U.S. Patent No. , also reported in Kolykhalov et al., 

Science 277:570 (1997), which describes an infectious clone of an isolate of the H strain (type 
la). HCV clones of other subtypes are now known. See, e.g., Yanagi et al., Virology 
262:250-263 (1999) and Yanagi et al., Virology 244:161-172 (1998). While RNA transcripts 

10 of these clones are able to infect chimpanzees, cell cultures with these clones only support 
replication of the virus poorly if at all. 

As described in U.S. Patent Application No. 08/81 1,566 (Now U.S. Patent No. ) 

(see, e.g., Figure 2 therein) many variations of a functional clone are possible. These include 
full length or partial sequences where a foreign gene is inserted. The foreign gene can 

15 include, e.g., a reporter gene such as (3-galactosidase or luciferase, or a gene encoding a 

selectable marker such as neo, DHFR, or ik. In a specific example disclosed therein, the neo 
gene is operably linked to an internal ribosome entry site (IRES), in order for infected cells to 
be selected by neomycin or G418 resistance. In this way, presence of replicating HCV RNA 
in essentially all surviving cells is assured. Additionally, the HCV polyprotein coding region 

20 of these clones can be deficient in some or all of the structural genes C, El and E2. Thus, 

replicons can be created without the production of virions. By combining the structural gene- 
deficient construct with a selectable marker such as neo, an efficiently replicating replicon 
system can be created that can be used to study HCV replication and for other purposes. 

Examples of the replicons disclosed in U.S. Patent Application No. 08/811,566 (Now 

25 U.S. Patent No. ) is provided in Lohmarm et al., Science 285: 110-113 (1999). In that 

work, DNA clones of HCV replicons of genotype 1, subtype lb were constructed. Features 
of those replicons that are not wild-type HCV features are: a polyprotein coding region 
lacking the genes encoding the HCV structural proteins; an EMCV IRES immediately 5' to 
the polyprotein region; and a neo gene immediately 3' to the 5' NTR (and the HCV IRES), 

30 where the 5' end of the HCV C protein gene is fused to the 5' end of the neo gene. When 
Huh-7 cells were transfected with RNA transcripts of these clones, 6 to >60 G418-resistant 
colonies arose per experiment. Although the number of cells treated was not specified, about 
10 6 - 10 7 cells are normally treated in experiments of this type. Therefore, it is believed that 
the transfection efficiency, as measured by G418-resistant colonies/total treated, was less than 

35 .01% in those studies. 
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Controls in the Lohmann et al. work included in-frame deletions of the active site of 
the NS5B polymerase. Although care was taken to remove template DNA from the control 
transcripts, several G418-resistant control colonies arose. Still, the number of G418-resistant 
control colonies that arose was much less than the colonies arising from the cells transfected 
5 with the replicons containing the wild-type NS5B. 

When the G418-resistant colonies were subpassaged, most could not be maintained. 
Out of more than 303 G418-resistant colonies from non-control replicon treatments, 9 (<3%) 
could be subpassaged to establish stable cell lines. Replicons established in infected cell lines 
were sequenced. Although each replicon had a number of amino acid substitutions, the 

10 substitutions were scattered throughout the polyprotein coding region. Therefore, there were 
no mutations that were consistently in one area of the polyprotein coding region, and it was 
concluded that the establishment of the nine cell lines was not due to adaptive mutations in 
those replicons. This contention was experimentally tested by transfection/reconstitution 
experiments that did not provide evidence for adaptive changes. 

15 Despite the advances described above, more efficient HCV-infected cell systems are 

needed for the production of concentrated virus stocks, structural analysis of virion 
components, evaluation of putative antiviral therapies including vaccines and antiviral 
compounds, and improved analyses of intracellular viral processes, including RNA 
replication. Thus, there is a need for various types of HCV clones that can be used for any of 

20 the above purposes. There is also a need to characterize HGV with respect to regions of the 
genome that might contribute to more efficient in vitro or in vivo replication and virion 
production. 

Summary of the Invention 
25 Thus, a primary object of the present invention has been to provide DNA encoding 

non-naturally occurring HCV that is capable of replication. 

A related object of the invention is to provide genomic RNA from the above DNA. 
Still another object of the invention is to provide attenuated HCV DNA or genomic RNA 
suitable for vaccine development, which can invade a cell and replicate but cannot propagate 
30 infectious virus. 

Another object of the invention is to provide in vitro and in vivo models of HCV 
infection and RNA replication for testing anti-HCV (or antiviral) drugs, for evaluating drug 
resistance, and for testing attenuated HCV viral vaccines. 
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An additional object of the invention is to provide replicating HCV replicons. These 
replicons do not encode structural proteins but may encode a foreign protein such as a 
reporter gene or a selectable marker. 

Still another object of the invention is to provide adaptive replicons, with increased 
5 ability to establish replication in continuous or primary cell lines. 

Briefly, therefore, the inventors have succeeded in discovering methods of creating 
replicating HCV variants, including variants with adaptive mutations in HCV that improve 
their ability to establish RNA replication in culture to create continuous cell lines. These 
HCV variants and the cell lines that harbor them are useful for studying replication and other 
10 HCV characteristics. The cell lines are also useful for developing vaccines and for testing 
compounds for antiviral properties. 

Thus, in some embodiments, the present invention is directed to a polynucleotide 
comprising a non-naturally occurring HCV sequence that is capable of productive replication 
in a host cell, or is capable of being transcribed into a non-naturally occurring HCV sequence 
that is capable of productive replication in a host cell. The HCV sequence comprises, from 5' 
to 3' on the positive-sense nucleic acid, a functional 5' non-translated region (5* NTR); one or 
more protein coding regions, including at least one polyprotein coding region that is capable 
of replicating HCV RNA; and a functional HCV 3' non-translated region (3' NTR). In 
preferred embodiments of these polynucleotides, the 5* NTR is an HCV 5' NTR, the 
polynucleotide comprises at least one IRES selected from the group consisting of a viral 
IRES, a cellular IRES, and an artificial IRES, and the polyprotein coding region is an HCV 
polyprotein coding region. 

In certain aspects of these embodiments, the above polynucleotides further comprise 
an adaptive mutation. The adaptive mutation can be such that the polynucleotide has a 
transfection efficiency into mammalian cells of greater than 0.01%; more preferably greater 
than 0.1%; even more preferably, greater than 1%; still more preferably greater than 5%, may 
be about 6%. The adaptive mutations can be such that the polynucleotide is capable of 
replication in a non-hepatic cell, for example HeLa cells. The adaptive mutations can also 
cause the polynucleotide to have attenuated virulence, wherein the HCV is impaired in its 
ability to cause disease, establish chronic infections, trigger autoimmune responses, and 
transform cells. 

In some embodiments of the above described adaptive mutants, the polyprotein 
region comprises an NS5A gene that is not a wild-type NS5A gene. Preferably, the NS5 A 
gene comprises a mutation. The mutation is preferably within 50 nucleotides of an ISDR or 
includes the ISDR; more preferably the mutati9on is within 20 nt of the ISDR, or includes the 
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ISDR. Examples of these adaptive mutations are those that encode an amino acid sequence 
change selected from the group consisting of Ser (1 179) to He, Arg (1 164) to Gly, Ala(l 174) 
to Ser, Ser(l 172) to Cys, and Ser(l 172) to Pro of SEQ ID NO:3. Other adaptive mutations 
include a deletion of at least a portion of the ISDR, and may comprise the entire ISDR. In a 
5 particular embodiment, the adaptive mutation comprises a deletion of nucleotides 5345 to 
5485ofSEQIDNO:6. 

In some embodiments of the invention polynucleotides, the HCV polyprotein coding 
region encodes all HCV structural and nonstructural proteins. In other embodiments, the 
polyprotein coding region is incapable of making infectious HCV particles, making the HCV 

10 variant a replicon. Preferably the inability to make HCV particles is due to a deletion in the 
structural protein coding region. Some embodiments of these replicons further comprise a 
foreign gene operably linked to a first IRES and the HCV polyprotein coding region operably 
linked to a second IRES. Preferably, the replicon comprises a genotype 1 HCV sequence, 
most preferably subtype lb. Preferred foreign genes in these replicons are selectable markers 

15 or reporter genes. In other preferred replicon embodiments, the first IRES is an HCV IRES, 
the foreign gene is a neo gene, and the second IRES is a EMCV IRES. Examples of the 
above replicons include SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:22 and SEQ ID NO:25. 
The above replicons also preferably comprise an adaptive mutation, including any of the 
adaptive phenotypes previously described, including increased transfection efficiency, 

20 replication in a non-hepatic cell including HeLa cells, and attenuated virulence, and further 
comprising any of the adaptive mutations previously described, such as the various NS5A 
mutations and deletions previously described. 

The polynucleotides of the present invention can be in the form of RNA or DNA. 
Preferred embodiments of the polynucleotides are SEQ ID NOs:5-13 and 22-25, the 

25 complements thereof, and the RNA equivalents of the sequences or their complements. In 

certain embodiments, the polynucleotides are capable of productive infection in a chimpanzee 
upon intrahepatic injection. 

The present invention is also directed to expression vectors comprising DNA forms of 
any of the above polynucleotides, operably associated with a promoter. Additionally, the 

30 invention is directed to cells comprising the above expression vectors as well as host cells 
comprising any of the polynucleotides described above. The host cells are preferably 
mammalian cells, more preferably human cells. The host cells are preferably hepatocytes, T- 
cells, B-cells, or foreskin fibroblasts; most preferably hepatocytes. Certain adaptive mutants 
can also replicate in HeLa cells. The host cells can be within a non-human mammal capable 
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of supporting transfection and replication of the HCV KNA, and infection when the HCV 
RNA encodes a virus particle. A preferred non-human mammal is a chimpanzee. 

In additional embodiments, the present invention is directed to methods for 
identifying a cell line mat is permissive for KNA replication with HCV. The method includes 
5 the steps of contacting a cell in tissue culture with an infectious amount of the above- 
described polynucleotides, and detecting replication of HCV variants in cells of the cell line. 

The present invention is also directed to a method for producing a cell line 
comprising replicating HCV. The method includes the steps of (a) transcribing the above- 
described expression vector to synthesize HCV KNA; (b) transfecting a cell with the HCV 
10 RNA; and (c) culturing the cell. 

Additionally, the present invention is directed to a vaccine. The vaccine includes any 
of the above-described polynucleotides, in a pharmaceutically. acceptable carrier. In related 
embodiments, the present invention is directed to a method of inducing immunoprotection to 
HCV in a primate. The method includes administering the vaccine to the primate. 
15 In further embodiments, the present invention is directed to a method of testing a 

compound for inhibiting HCV replication. The method includes the steps of (a) treating the 
above described host cells with the compound; and (b) evaluating the treated host cell for 
reduced replication, wherein reduced HCV replication indicates the ability of the compound 
to inhibit replication. 

20 In additional embodiments, the present invention is directed to a method of testing a 

compound for inhibiting HCV infection. The method comprises treating a host cell with the 
compound before, during or after infecting the host cell with any of the invention 
polynucleotides. 

In still other embodiments, the present invention is directed to an HCV variant that 
25 has (a) transfection efficiency greater than .0.01%, as determined by replication-dependent 
neomycin resistance, or (b) greater ability of initial colonies of cells transfected with the 
variant to survive subpassage than wild-type HCV genotype 1, subtype lb. The HCV variant 
also has, from 5' to 3' on the positive-sense nucleic acid, a functional HCV 5' non-translated 
region (5 r NTR) comprising an extreme 5'-terminal conserved sequence; an HCVpolyprotein 
30 coding region; and a functional HCV 3' non-translated region (3*NTR) comprising a variable 
region, a polypyrimidine region, and an extreme 3'-terminal conserved sequence. In preferred 
embodiments, the transfection efficiency is greater than 0.1%; in more preferred 
embodiments, greater than 1%; in still more preferred embodiments, greater than 5%. Li the 
most preferred embodiments, the transfection efficiency is about 6%. 
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The variants can have any of the characteristics of the polynucleotides described 
above. However, preferred variants comprise the NS5A mutation or deletion described for 
the polynucleotides above. 

Among the several advantages achieved by the present invention are the provision of 
5 polynucleotides comprising non-naturally occurring HCV sequences; the provision of HCV 
variants that have a transfection efficiency and ability to survive subpassage greater than 
HCV forms that have wild-type polyprotein coding regions; the provision of expression 
vectors comprising the above polynucleotides and HCV variants; the provision of cells and 
host cells comprising the above expression vectors, the provision of methods for identifying a 
10 cell line that is permissive for RNA replication with HCV; the provision of vaccines 

comprising the above polynucleotides in a pharmaceutically acceptable carrier; the provision 
of methods for inducing immunoprotection to HCV in a primate; and the provision of 
methods for testing a compound for inhibiting HCV replication. 

15 Brief Description of the Drawings 

FIGURE 1. HCV genome structure, polyprotein processing, and protein features. At the top 
is depicted the viral genome with the structural and nonstructural protein coding regions, and 
the 5'and 3' NTRs, and the putative 3' secondary structure. Boxes below the genome indicate 
proteins generated by the proteolytic processing cascade. Putative structural proteins are 

20 indicated by shaded boxes and the nonstractural proteins by open boxes. Contiguous 

stretches of uncharged amino acids are shown by black bars. Asterisks denote proteins with 
N-linked glycans but do not necessarily indicate the position or number of sites utilized. 
Cleavage sites shown are for host signalase (♦), the NS2-3 proteinase (curved arrow), an the 
NS3-4A serine protease (% 

25 

FIGURE 2. Strategies for expression of heterologous RNAs and proteins using HCV vectors. 
At the top is a diagram of the positive-polarity RNA virus HCV, which expresses mature viral 
proteins by translation of a single long ORF and proteolytic processing. The regions of the 
polyprotein encoding the structural proteins (STRUCTURAL) and the nonstructural proteins 

30 (REPLICASE) are indicated as lightly-shaded and open boxes, respectively. Below are 
shown a number of proposed replication-competent "replicon" expression constructs. The 
first four constructs (A-D) lack structural genes and would therefore require a helper system 
to enable packaging into infectious virions. Constructs E-G would not require helper 
functions for replication or packaging. Darkly shaded boxes indicate heterologous or foreign 

35 gene sequences (FG). Translation initiation (aug) and termination signals (trm) are indicated 
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by open triangles and solid diamonds, respectively. Internal ribosomes entry sites (IRES) are 
shown as boxes with vertical stripes. Constructs A and H illustrate the expression of a 
heterologous product as an in-frame fusion with the HCV polyprotein. Such protein fusion 
junctions can be engineered such that processing is mediated either by host or viral 
5 proteinases (indicated by the arrow). 

FIGURE 3. Structure ofHCVreplbBartMan. Two versions of this infectious replicon were 
constructed as described in Example 1. The first, HCVreplbBartMan/Avall, has & Avail 
restriction site in the variable domain of the 3' NTR that is not present in the 3* NTR of wild- 
type HCV subtype lb. The second variant, HCVreplbBartMan/A2U's, has 32, rather than the 
wild-type 34, LPs in the longest stretch of contiguous IPs in the polypyrimidine domain of the 
3' NTR. The "GDD— ►AGG" designation shows the inactivating mutation in the non- 
replicating replicons that were used as polymerase-minus controls in Example 1. 

FIGURE 4. Generation of G418-resistant cell clones. At the top is a diagram of the 
HCVreplbBartMan replicons as described in Figure 3. The middle text summarizes the steps 
used to isolate the adaptive mutants, which are further described in Example 1. The bottom 
chart summarizes several characteristics of some of the replicons isolated as described in the 
Example. 

FIGURE 5. Synthesis of HCV-specificRNA and proteins. Figure 5A illustrates actinomycin 
D-resistant KNA replication of four adaptive replicons as further described in the Example. 
Figure 5B illustrates the immunoprecipitation of 35 S-labeled HCV-specific proteins of three 
adaptive replicons as further described in Example 1. 

FIGURE 6. Detection ofNS3 in G418-resistant cell clones. Monolayers of cells transfected 
with various replicons as indicated were immunostained with an anti-NS3 antibody. Patterns 
of staining were similar to cells stained from an infected liver. 

FIGURE 7. Nucleotide and amino acid changes in the NS5A coding region of HCV. 
Nucleotide and amino acid changes in a portion of the NS5 A coding region of seven adaptive 
clones are indicated. 

FIGURE 8. G418~resistant colonies generated after electroporation of replicon RNAs into 
Huh7 cells. The ability of an adaptive replicon (Replicon I) to establish colonies after 
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transfection into Huh7 cells. (middle) is compared to the original replicon 
HCVrepBartMan/Avall (left) and the same adaptive replicon, but with an inactivating 
mutation in the polymerase gene (right). 

5 FIGURE 9. Structures of HCVreplicons and fiill-lengthHCVRNAs. The adaptive replicon 
5'NTR-EMCV has the S'NIR fused directly to the EMCV IRES upstream of NS3. Another 
adaptive replicon, HCVrep/NS2-5B has the non-structural protein, NS2, upstream of NS3. A 
full-length HCV cDNA clone, HCV FL, was assembled. Also, a bicistronic derivative, HCV 
FL-neo, was assembled where the SWlK is fused to the neomycin phosphotransferase gene 
10 and the EMCV IRES is upstream of the HCV open reading frame. In both full-length clones, 
the open reading frame comprises the structural and non-structural regions, from capsid to 
NS5B. In addition, all of the replicons and full-length HCV RNAs comprise the mutation 
coding for Ser to lie substitution at position 1 179 of SEQ ID NO:3, in NS5A. 

15 FIGURE 10. RNA replication of replicons and full-length HCV RNAs. The HCV replicons . 
and full-length HCV RNAs shown in FIGURE 9 are replication competent. 

Detailed Description of the Invention 

Definitions 

20 Various terms are used herein, which have the following definitions: 

As used herein, "HCV polyprotein coding region" means the portion of a hepatitis C 
virus that codes for the polyprotein open reading frame (ORF). This ORF may encode 
proteins that are the same or different than wild-type HCV proteins. The ORF may also 
encode only some of the functional proteins encoded by a wild-type polyprotein coding 

25 region. The proteins encoded therein may also be from different isolates of HCV, and non- 
HCV proteins may also be encoded therein. 

The phrase "pharmaceutically acceptable" refers to molecular entities and 
compositions that are physiologically tolerable and do not typically produce an allergic or 
similar untoward reaction, such as gastric upset, dizziness and the like, when administered to 

30 a human. Preferably, as used herein, the term "pharmaceutically acceptable" means approved 
by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopoeia or other generally recognized pharmacopoeia for use in animals, and more 
particularly in humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle 
with which the compound is aekrunistered. Such pharmaceutical carriers can be sterile 

35 liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic 
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origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. Water or aqueous 
solution saline solutions and aqueous dextrose and glycerol solutions are preferably employed 
as carriers, particularly for injectable solutions. Suitable pharmaceutical carriers are 
described in "Remington's Pharmaceutical Sciences" by E.W. Martin. 

5 The phrase "therapeutically effective amount" is used herein to mean an amount 

sufficient to reduce by at least about 15 percent, preferably by at least 50 percent, more 
preferably by at least 90 percent, and most preferably prevent, a clinically significant deficit 
in the activity, function and response of the host. Alternatively, a therapeutically effective 
amount is sufficient to cause an improvement in a clinically significant condition in the host. 

10 The term "adjuvant" refers to a compound or mixture that enhances the immune 

response to an antigen. An adjuvant can serve as a tissue depot that slowly releases the 
antigen and also as a lymphoid system activator that non-specifically enhances the immune 
response (Hood et al., Immunology, Second Ed., 1984, Benjamin/Cummings: Menlo Park, 
California, p. 384). Often, a primary challenge with an antigen alone, in the absence of an 

15 adjuvant, will fail to elicit a humoral or cellular immune response. Adjuvants include, but are 
not limited to, complete Freund's adjuvant, incomplete Freund's adjuvant, saponin, mineral 
gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil or hydrocarbon emulsions, keyhole limpet hemocyanins, 
dinitrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette-Gueriri) 

20 and Corynebacterium parvum. Preferably, the adjuvant is pharmaceutically acceptable. 

In a specific embodiment, the term "about" or "approximately" means within 20%, 
preferably within 10%, and more preferably within 5% of a given value or range. 

Hie term "virus infection" as used herein, refers to the usual way that wild-type virus 
particles become established in host cells. This generally includes binding to the host cell, 

25 uptake, delivery to the cytosol or nucleus, and initiation of replication. 

The term "transfection" as used herein, refers to the infection of a cell with a 
polynucleotide. The polynucleotide can be DNA or RNA. A preferred method of 
transfecting a cell with an HCV polynucleotide is with replication competent RNA. Delivery 
to permissive cells can be facilitated by electroporation, charged liposomes, high salt, DE 

30 dextran, etc. Replication competent RNAs can also be launched in cells after transfection of 
DNA such as plasmids or DNA viruses that have been appropriately engineered to provide 
transcription initiation and termination signals. The transfected RNAs can represent full- 
length genome RNAs capable of initating a complete replication cycle (including production 
of progeny virus), or they may be defective lacking one or more RNA elements or proteins 

35 essential for virion production but not RNA replication. The latter RNAs, which are lacking 
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in the ability to produce a virion, will be referred to generally herein as "replication competent 
KNAs", "RNATeplicons" or "replicons". 

As used herein, the term "subpassage" connotes the transfer of a colony from one 
vessel of media to another vessel of media. Examples of vessels of media include dishes, 
5 bottles or test tubes with solid or liquid growth media. Unless otherwise indicated, 

"subpassage" means the transfer of a colony of HCV-transfected cells from a vessel of media 
where the newly transfected cells were plated to a vessel of media where the colony is 
isolated. 

The term "authentic" is used herein to refer to an HCV polynucleotide, whether a 

10 DNA or RNA, that provides for replication and production of functional HCV proteins, or 

components thereof. The authentic HCV polynucleotides of the present invention are capable 
of replication and may be infectious, e.g., in a chimpanzee model or in tissue culture, to form 
viral particles (i.e., "virions"). An authentic HCV polynucleotide of the present invention 
may also be a "replicon", such that it is incapable of producing the full complement of 

15 structural proteins to make a replication competent infectious virion. However, such 
replicons are capable of RNA replication. Thus, the authentic HCV polynucleotides 
exemplified in the present application contains all of the virus-encoded information, whether 
in RNA elements or encoded proteins, necessary for initiation of an HCV RNA replication 
cycle. The authentic HCV polynucleotides of the invention include modifications described 

20 herein, e.g. , by site-directed mutagenesis or by culture adaptation, producing a defective or 
attenuated derivative, or an adaptive variant Alternatively, sequences from other genotypes 
or isolates can be substituted for the homologous sequence of the specific embodiments 
described herein. For example, an authentic HCV nucleic acid of the invention may comprise 
the adaptive mutations disclosed herein, e.g., on a recipient plasmid, engineered into the 

25 polyprotein coding region of a functional clone from another isolate or genotype (either a 
consensus region or one obtained by very high fidelity cloning). In addition, the HCV 
polynucleotide of the present invention can include a foreign gene, such as a gene encoding a 
selectable marker or a reporter protein. 

30 General Description 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell culture, molecular biology, microbiology, recombinant DNA, 
and immunology, which are within the skill of the art. Such techniques are explained fully in 
the literature. See, e.g., Ausubel et al. (ed.) (1993) "Current protocols in molecular biology. 

35 Green Publishing Associates, New York; Ausubel et al. (1995), "Short Protocols in Molecular 
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Biology", John Wiley and Sons; Joseph Sambrook et al. (1989), "Molecular Cloning, A 
Laboratory Manual", second ed., Cold Spring Harbor Laboratory Press; the series, 
METHODS IN ENZYMOLOGY (Academic Press, Inc.); Animal Cell Culture [R.I. 
Freshney, ed. (1986)]; Lau, ed. (1999), HEPATITIS C PROTOCOLS, Humana Press, 
5 New York; and Immobilized Cells And Enzymes [SSL Press, (1986)]; all of which are 
incorporated by reference. 

The present invention is directed to variants of hepatitis C virus (HCV) and methods 
for producing the variants. As used herein, an HCV variant is a non-naturally occurring HCV 
sequence that is capable of productive replication in a host cell. The genetic sequence of 

10 these variants may comprise insertions, deletions, or base mutations from wild type HCV 
sequences. As further discussed infra, the variants may be produced by genetic engineering, 
by methods known to the skilled artisan (see, e.g., U.S. Patent Application No. 08/81 1,566 

(Now U.S. Patent No. ); Lohmann et al., Science 285: 110-1 13(1999)). Alternatively, as 

further discussed infra, the variants may also be produced by culture selection methods, or a 

1 5 combination of culture selection and genetic engineering. 

The variants are in the form of DNA or RNA and can be" incorporated into any useful 
form of those compounds, for example in extrachromosomal DNA that replicates in a 
microorganism such as E. coli or yeast Included among these are plasmids, phage, BACs, 
YACs, etc. RNA and virions comprising the variant are also envisioned as within the scope 

20 of the invention. The variants of the present invention can also be in the form of cassettes for 
insertion into a DNA cloning vector. The HCV RNAs are envisioned to be complementary to 
any HCV DNA disclosed herein. An infectious HCV RNA is a positive strand RNA created 
from the negative strand template of the HCV DNA clone of the invention. 

The variants of the present invention are not narrowly limited to any particular virus 

25 subtype. Thus, any particular component of the variant, or the entire variant, may be from 
any HCV subtype. Preferred subtypes are la and lb, due to the widespread occurrence, as 
well as the large amount of knowledge available for those two subtypes. However, the use of 
any other genotype or subtype, as would be considered within the skill of the art, is 
envisioned as within the scope of the invention. These subtypes include, but are not limited 

30 to, any subtypes within genotypes HCV-1, HCV-2, HCV-3, HCV-4, HCV-5, and HCV-6. 
Moreover, since HCV lacks proofreading activity, the virus itself readily mutates, forming 
mutant "quasi-species" of HCV that are also contemplated as useful for the present invention. 
Such mutations are easily identified by sequencing isolates from a subject, as detailed herein 
or in U.S. Patent Application No. 08/8 1 1,566 (Now U.S. Patent No. ). It would be 

35 expected that the methods and compositions disclosed herein are useful for any known 
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subtype or quasi-species, or any subtype or quasi-species not now known but that is 
discovered in the future. 

The HCV variants of the invention include a 5-NTR conserved sequence, which 
generally comprises the 5'-terminal sequence GCCAGCC, and which may have additional 
5 bases upstream of this conserved sequence without affecting functional activity of the HCV 
nucleic acid. In a preferred embodiment, the 5-GCCAGCC includes from 0 to about 10 
additional upstream bases; more preferably it includes from 0 to about 5 upstream bases; more 
preferably still it includes 0, one, or two upstream bases. In specific embodiments, the 
extreme 5'-terminal sequence may be GCCAGCC; GGCCAGCC; UGCCAGCC; 

10 AGCCAGCC; AAGCCAGCC; GAGCCAGCC; GUGCCAGCC; or GCGCCAGCC, wherein 
the sequence GCCAGCC is the 5'-terminus of SEQ ID NO: 1. However, the scope of the 
HCV variants of the invention encompasses any functional HCV 5' NTR, whether now 
known or later discovered. 

The HCV variants of the invention also include a 3' NTR that comprises a poly- 

15 pyrimidine region as is known in wild-type HCV. These polypyrimidine regions are known 
to comprise, on the positive-strand HCV RNA, a poly(U)/poly(UC) tract or a poly(A) tract. 
However, the polypyrimidine region of the present invention may also include other 
polypyrimidine tracts that are not now known but are later found to be functional in infectious 
HCV. As is known in the art, the polypyrimidine tract may be of variable length: both short 

20 (about 75 bases) and long (133 bases) are effective, although an HCV clone containing a long 
poly(U/UC) tract is found to be highly infectious. Longer tracts may be found in naturally 
occurring HCV isolates. Thus, an authentic HCV nucleic acid of the invention may have a 
variable length polypyrimidine tract. 

The 3' NTR also comprises, at its extreme 3' end, the highly conserved RNA element 

25 of about 98 nucleotides known in the art, and as described in, e.g., U.S. Patent No. 5,874,565, 

U.S. Patent Application No. 08/81 1,566 (Now U.S. Patent No. ), and U.S. Patent No. 

5,837,463 . In a specific aspect, the 3'-NTR extreme terminus is RNA homologous to a DNA 
having the sequence 

S'-TGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCC 
30 GCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCTGATCATGT-3' (SEQ ID 
NO:2). However, the scope of the invention is meant to encompass HCV variants with any 
HCV 3' NTR that allows virus replication, whether the sequence is now known or later 
discovered. Included are 3' NTRs that do not comprise a variable region. 

The HCV variants of the present invention also include a polyprotein coding region 
35 sufficient to allow replication of the HCV RNA. Thus, the polyprotein coding region may be 
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deficient in functional genes encoding the full complement of the HCV structural genes C, El 
and E2. In addition, the polyprotein coding region may comprise deletions, insertions, or 
mutations that do not occur in wild-type HCV strains. Further, the polyprotein coding region 
may be chimeric, such that some of the genes encoded therein are from analogous regions of 
5 another virus, as discussed infra. 

The HCV variants encompassed by the present invention include variants that do not 
produce virus particles. These variants, which may be termed "replicons" , lack the ability to 
produce a fully functional complement of the structural proteins C, El and E2. The inability 
to produce the functional structural protein component of the HCV virus may be conferred by 
10 deletion of the genes encoding one, two, or all three of these proteins. Alternatively, a 
deletion of a small portion of the coding sequence of one of the structural proteins, or a 
mutation in a critical region of the coding sequence, or an insertion into the coding sequence 
could lead to an HCV that cannot produce virions. In the latter case, the insertion can be any 
sequence that disrupts the ability of the structural protein from becoming part of a virion, and 
1 5 can include functional sequences, such as those that encode a reporter gene (such as p- 

galactosidase) or those that confers selectability to the cell harboring the replicon (such as 
ned). The above manipulations are entirely wilhin the skill of the art See, e.g., Lohmann et 
al., supra and Example 1. As discussed infra, such variants are useful for studying replication 
of the HCV virus, among other things. 
20 The variants of the present invention can also comprise an alteration in the coding 

sequence of the polyprotein coding region that does not affect the production of functional 
virions or replicons. These alterations can be such that the amino acid sequence of the mature 
protein is not changed from the wild-type sequence, due to the degeneracy of the genetic 
code. Such alterations can be useful, e.g., when they introduce or remove a restriction site, 
25 such that the size of HCV fragments produced by digestion with a restriction enzyme is 

altered. This provides a distmguishing characteristic of that variant, which can be used, e.g., 
to identify a particular infectious isolate in a multiple infection animal model, or to provide 
convenient sites for subsequent engineering. Any technique for mutagenesis known in the art 
can be used, including but not limited to in vitro site-directed mutagenesis [Hutchinson, C, et 
30 al, 1978, J. Biol. Chem. 253:6551; Zoller and Smith, 1984, DNA 3:479-488; Oliphant et al, 
1986, Gene 44:177; Hutchinson et al, 1986, Proc. Natl. Acad. Sci. U.S.A. 83:710], use of 
TAB® linkers (Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis 
[see Higuchi, 1989, "Using PCR to Engineer DNA", in PCR Technology: Principles and 
Applications for DNA Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70]. 
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Alterations in the polyprotein coding sequence can also introduce conservative amino 
acid substitutions in the HCV-encoded proteins. Conservative amino acid substitutions refer 
to the interchangeabiliry of residues having similar side chains. Conservatively substituted 
amino acids can be grouped according to the chemical properties of their side chains. For 
5 example, one grouping of amino acids includes those amino acids have neutral and 

hydrophobic side chains (A, V, L, I, P, W, F, and M); another grouping is those amino acids 
having neutral and polar side chains (G, S, T, Y, C, N, and Q); another grouping is those 
amino acids having basic side chains (K, R, and H); another grouping is those amino acids 
having acidic side chains (D and E); another grouping is those amino acids having aliphatic 

10 side chains (G, A, V, L, and I); another grouping is those amino acids having aliphatic- 
hydroxyl side chains (S and T); another grouping is those amino acids having amine- 
containing side chains (N, Q, K, R, and H); another grouping is those amino acids having 
aromatic side chains (F, Y, and W); and another grouping is those amino acids having sulfur- 
containing side chains (C and M). Preferred conservative amino acid substitutions are: R-K; 

1 5 E-D, Y-F, L-M; V-I, and Q-H. Conservative amino acid substitutions, when conferred on the 
structural proteins, can alter antigenic epitopes, and thus the immune reactivity of the virus. 
Those substitutions could also alter the function of the non-structural proteins, such that the 
virus reproduces at a different rate or is altered in its ability to replicate in cell culture or in an 
organism. See, e.g., Example 1, where replicon IV is adaptive to cell culture conditions due 

20 to the conservative amino acid substitution Ser — > Cys in the NS5A protein. 

Alterations in the polyprotein coding region could also introduce nonconservative 
amino acid substitutions in one or more of the proteins encoded therein. Nonconservative 
substitutions would be expected to alter protein function more drastically than conservative 
substitutions, and would thus be more likely than conservative substitutions to alter 

25 phenotypic characteristics of the virus such as replication rate, adaptation to cell culture or in 
vivo culture, and displayed antigenic determinants. Examples are several adaptive mutations 
in the NS5 A coding region described in the , infra. 

In some embodiments of the invention, the polyprotein coding region has a consensus 
sequence derived from more than one HCV isolate. For example, an authentic HCV nucleic 

30 acid of the invention may comprise a 5' and 3' sequence from any one subtype of the virus and 
a polyprotein region from any other subtype. Alternatively, only one of the proteins encoded 
in the polyprotein might be from another viral subtype. ,Ih this way, the effect of a particular 
protein in conferring characteristics of a particular strain (e.g., reduced virulence, increased 
replication rate etc.) can be studied. 
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Chimeras with other viruses, such as with bovine viral diarrhea virus, or another 
flavivirus, are also envisioned. See, e.g., PCT/US99/08850, incorporated herein by reference. 
In these embodiments, components of the functional clones can be used to construct chimeric 
viruses for assay of HCV gene functions and inhibitors thereof [Filocamo et ah, J. Virol. 71: 
5 1417-1427 (1997); Hahm et al, Virology 226: 318-326 (1996); Lu and Wimmer, Proc Natl 
Acad Sci USA 93: 1412-7 (1996)]. In one such extension of the invention, functional HCV 
elements such as the 5' IRES, proteases, RNA helicase, polymerase, or 3' NTR are used to 
create chimeric derivatives of BVDV whose productive replication is dependent on one or 
more of these HCV elements. Such BVDV/HCV chimeras can then be used to screen for and 
1 0 evaluate antiviral strategies against these functional components. 

Chimeras where a gene encoding a structural or nonstructural protein from a closely 
related virus such as GB virus B replaces the corresponding HCV gene would also be 
expected to be functional. See, e.g., Butkiewicz et al., 2000, J. Virol 74,4291-4301. 

15 Other alterations in the polyprotein coding region contemplated by the present 

invention include deletions or insertions in the sequence. Such alterations may also alter 
replication rate, adaptation to various growth conditions, or antigenic determinants. A 
preferred example of a useful deletion includes the 47 amino acid deletion and replacement of 
Ser 1 182 to Asp 1229 of SEQ ID NO:3 with Tyr, which is an adaptive mutation in the NS5A 

20 that provides greater transfection efficiency than HCVs with wild-type NS5A. See Example 
1. 

Insertions into the polyprotein coding region can be of any length and into any area of 
the region, provided the modified HCV is still able to replicate. Preferably, the insertion is 
engineered in frame with the rest of the polyprotein coding region, to allow correct translation 

25 of the polyprotein region downstream from the insertion. 

Insertions into the polyprotein coding region could introduce a gene encoding a 
heterologous protein. The choice of heterologous protein is not narrowly limited and can 
include a protein that is therapeutic to the infected host or cell, or a protein that is harvested 
and purified for another purpose. Particularly useful heterologous genes include those used 

3 0 for detection of the variant (i.e., reporter genes), or for selection of cells having the variant. 
Nonlimiting examples of reporter genes useful in the present invention include P- 
galactosidase, (3-glucuronidase, firefly or bacterial luciferase, green fluorescent protein (GFP) 
and humanized derivatives thereof, cell surface markers, and secreted markers. Such products 
are either assayed directly or may activate the expression or activity of additional reporters. 

35 Nonlimiting examples of selectable markers for mammalian cells include, but are not limited 
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to, the genes encoding dihydrofolate reductase (DHFR; methotrexate resistance), thymidine 
kinase (tt, methotrexate resistance), puromycin acetyl transferase (pac; puromycin 
resistance), neomycin resistance (neo\ resistance to neomycin or G418), mycophenolic acid 
resistance (gpi), hygromycin resistance, blasticidin resistance, and resistance to zeocin. Other 
selectable markers can be used in different hosts such as yeast (ura3, his3, leu2, trpY). 

The present invention also encompasses HCV variants that have alterations in the 
noncoding regions of the virus. For example, the foreign gene discussed above can also be 
inserted into a noncoding region of the virus, provided the region with the insert continues to 
be sufficiently functional to allow replication. To provide for translation of a foreign gene 
inserted into a noncoding region, the foreign gene must be operatively linked to translational 
start signals, preferably an internal ribosome entry site (IRES) derived from cellular or viral 
mRNAs [Jang et al, Enzyme 44: 292-309 (1991); Macejak and Sarnow, Nature 353: 90-94 
1991); Molla et al, Nature 356:255-257(1992)]. In essence, this strategy creates a second 
cistron in the variant, separate from the polyprotein coding region cistron. A preferred IRES 
is the encephalomyocarditis virus (EMCV) IRES. 

The foreign gene can also be inserted into the 3 1 NTR or the 5' NTR. In the 3' NTR, 
the foreign gene/IRES cassette is preferably inserted into the most 5', variable domain. 
However, insertions are also envisioned for other regions of the 3' NTR, such as at the 
junction of the variable region and the polypyrimidine region, or within the polypyrirnidine 
region. In the 5' NTR, the foreign gene is preferably inserted into the area just adjacent (3' to) 
the internal HCV IRES. In these variants, the foreign gene is engineered to be operably 
linked to the HCV IRES. Where this is the case, it is preferred that the second IRES (e.g., an 
EMCV IRES) is engineered just 5' to the polyprotein coding region, to be operably linked to 
that region. See Example and Lohmann et al., supra. 

Some of the above strategies for functional expression of heterologous genes have 
been previously described. See Bredenbeek and Rice, (1992) supra for review; see, also 
Figure 2, which is also Figure 2 of U.S. Patent Application No. 08/811,566 (Now U.S. Patent 
No. ). 

Additionally, noncoding region alterations such as mutations, deletions or insertions 
that do not encode a foreign protein are within the scope of the invention. For example, 
mutations, deletions of insertions in the variable or polypyrimidine regions of the 3' NTR, 
including deletions of the entire variable region, or in the 5' NTR region, that create or destroy 
restriction sites or make the variant otherwise identifiable can be used advantageously to 
create a "tagged" variant. See, e.g., Example, where a mutation in the variable region of the 3' 
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NTR created an easily identifiable Avail restriction site, and where a deletion in the 
polypyrimidine region created another identifiable variant. 

The polyprotein coding sequence can comprise mutants with desirable functional 
adaptations such as adaptive or attenuated variants. These improved variants can be superior 
5 in any desired characteristic. Nonlirrriting examples of characteristics that can be improved 
by the present methods include more rapid or more accurate replication in vivo or in culture, 
improved transfection efficiency, improved ability to establish subpassaged cell lines, ability 
to infect a host or a host cell line, virulence, and attenuation of disease symptoms. 

Such HCV variants may be adaptive, e.g., by selection for propagation in animals or 

10 in vitro. See, e.g., Example. Alternatively, me variants can be engineered by design to 

comprise the functional adaptation. See, e.g., Example, where a deletion was designed that 
had increased transfection efficiency and ability to be subpassaged to create a stable cell line, 
supporting persistent HCV replication. 

Non-functional HCV clones, e.g., that are incapable of genuine replication, that fail to 

1 5 produce HCV proteins, that do not produce HCV RNA as detected by Northern analysis, or 
that fail to infect susceptible animals or cell lines in vitro, can be corrected using components 
of the variants of the present invention. By comparing a variant of an authentic HCV nucleic 
acid sequence of the invention, with the sequence of the non-functional HCV clone, defects in 
the non-functional clone can be identified and corrected, and the corrected, replicating variant 

20 could have characteristics like the variant, such as an adaptive mutation, etc. All of the 
methods for modifying nucleic acid sequences available to one of skill in the art to effect 
modifications in the non-functional HCV genome, including but not limited to site-directed 
mutagenesis, substitution of the functional sequence from an authentic HCV variant for the 
homologous sequence in the non-functional clone, etc. 

25 Adaptation of HCV for more improved cell culture characteristics. Replication and 

transfection efficiency and stability of virions and replicons that have wild-type polyprotein 
replication in cell culture is inefficient That is, cells transfected with, e.g., RNA transcripts 
of clones of these strains replicate slowly in culture and the transfected cells are difficult to 
maintain. Additionally, transfection efficiency is poor. That is, very few cells that are 

30 transfected with the RNA replicon are able to support HCV replication. See, e.g., Example 1 
and Lohmann et al., supra, where less than 0.01% of Huh-7 cells transfected with RNA 
transcripts of replicons that have a wild-type (genotype 1, subtype lb) nonstructural 
polyprotein coding region grew into colonies on the petri dish where the transfectants were 
plated. Furthermore, a low percentage of colonies that arose from the original plating (<3%) 
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could be subpassaged onto another dish of media to form an isolated stable cell line 
supporting HCV replication. 

"Transfection efficiency" is defined by determining the percent of cells having 
replicating HCV RNA that continue to translate proteins encoded by the transfected nucleic 

5 acids. The easiest way to measure this is by delmmning the percentage of cells that exhibit a 
characteristic conferred by the HCV RNA. See, e.g., Example 1, where replicons comprising 
a neo gene conferred G418 resistance to the transfected cells, and where the cells were G418 
resistant after dividing and forming colonies on the dish where the transfected cells were 
plated. In that example, G418 resistance would not persist sufficiently for colonies to form 

10 unless the HCV RNA was able to replicate and partition into the dividing cells while 

continuing to replicate and translate the neo gene to confer G418 resistance. Transfection 
efficiency is thus replication dependent, in that the transfected HCV must replicate, 
transcribe, and translate the measured characteristic (here, G418 resistance). In the context of 
the neo selectable marker, this method of detennining transfection efficiency is termed 

15 "replication-dependent neomycin resistance". This is the preferred way of measuring 

transfection efficiency because it only measures transcription from HCV that established itself 
sufficiently to replicate and partition into dividing cells to form a colony. 

Another disadvantageous cell culture characteristic of HCV nucleic acid that has 
wild-type nonstructural polyprotein genes is that only a low percentage of colonies that form 

20 after transfection and selection are able to continue to be maintained upon subpassage as 
continuous cell lines harboring replicating RNA. This was <3% in Lohmann et al., as 
discussed supra. 

Disadvantageous characteristics of HCV having wild-type nonstructural polyprotein 
genes can be reduced by utilizing certain adaptive mutations and deletions in the NS5A 

25 coding region or elsewhere as disclosed herein. Preferred mutations comprise alterations in 
the encoded amino acid sequence in a region of the NS5A that is just 5' to the coding region 
of the "interferon sensitivity-detennining region" (ISDR). Specifically, various mutations 
vrithin about 50 nucleotides 5' to the ISDR, more preferably within about 20 nucleotides of 
the ISDR, where the encoded amino acid sequence is altered, have the effect of adapting an 

30 HCV to have higher transfection efficiency and increased ability to withstand subpassage to 
establish a cell line harboring persistent HCV replication. Specific mutations having this 
effect include Ser to He at amino acid 1 179 of SEQ ID NO:3 (subtype lb nonstructural 
polyprotein region), conferred, for example, by the mutation g to t at position 5336 of SEQ ID 
NO:6, embodied in SEQ ID NO:8 (nucleotide[nt]) and SEQ ID NO: 16 (amino acid[aa]); Arg 

35 to Gly at amino acid 1 164 of SEQ ID NO:3, conferred, for example, by the mutation from a to 
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g at position 5289 of SEQ ID NO:6, embodied in SEQ ID NO:9 (nt) and SEQ ID NO:17 (aa); 
Ala to Ser at amino acid 1 174 of SEQ ID NO:3, conferred, for example, by the mutation from 
g to t at position 5320 of SEQ ID NO:6, embodied in SEQ ID NO: 10 (nt) and the NS5A 
amino acid sequence of SEQ ID NO:19; Ser to Cys at amino acid 1 172 of SEQ ID NO:3, 
5 conferred, for example, by the mutation c to g at position 53 15 of SEQ ID NO:6, embodied in 
the NS5 A gene SEQ ID NO: 1 1 and the NS5 A amino acid sequence of SEQ ID NO:20; and 
Ser to Pro at amino acid 1 172 of SEQ ID NO:3, conferred, for example by the mutation t to c 
at position 53 14 of SEQ ID NO:6, embodied in the NS5 A gene SEQ ID NO: 12 and the NS5 A 
amino acid SEQ ED NO:21. The adaptive effect of these mutations is surprising since mis 
10 region of HCV is normally conserved among HCV isolates. Additionally, deletions within 
the ISDR, including deletions of the entire ISDR and various flanking sequences, cause this 
adaptive effect. Among these deletions is the substitution of the ISDR and flanking sequence 
comprising amino acids 1 182 to 1229 of SEQ ID NO:3 with a tyrosine, conferred, for 
example, by the deletion of nt 5345-5485 of SEQ ID NO:6, and embodied in SEQ DO NO:7 
(nt) and the NS5 A amino acid SEQ ID NO: 14. 

HCV variants comprising mutations adaptive to cell culture may also be attenuated, 
that is impaired in its ability to cause disease, establish chronic infections, trigger autoimmune 
responses, and transform cells. 

The present invention also discloses methods for selecting for adaptive HCV variants. 
These methods comprise the use of an HCV virion or preferably a replicon, which further 
comprises a dominant selectable marker such as a neo gene. Cells are transfected with these 
variants. The transfectants are plated into selection media, such as G418 when the neo gene is 
utilized in the variant. Colonies that arise to exhibit resistance to the selectable marker are 
subpassaged into fresh selection media. HCV in colonies that withstand subpassage to 
establish a cell line harboring HCV replication can be isolated and used to transfect additional 
cells. Any of these colonies that show increased transfection efficiency or other desirable 
characteristics, such as the ability to withstand subpassage, are adaptive variants, where the 
adaptive nature of the variant is conferred by at least one mutation or deletion. Selected areas 
of the HCV in these adaptive variants are sequenced. Preferably, at least the NS5A is 
sequenced. More preferably, the entire polyprotein coding region is sequenced. Any 
mutations in these variants can be further evaluated to determine the adaptive nature of the 
mutations. That evaluation preferably involves recreating the mutation in an otherwise wild- 
type coding region and determining if the recreated HCV mutant exhibits the adaptive 
phenotype of the original mutant. 
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Adaptive mutations could also be manifested, but are not restricted to: (i) altering the 
tropism of HCV KNA replication; (ii) altering viral products responsible for deleterious 
effects on host cells; (iii) increasing or decreasing HCV RNA replication efficiency; (iv) 
increasing or decreasing HCV RNA packaging efficiency and/or assembly and release of 
5 HCV particles; (v) altering cell tropism at the level of receptor binding and entry. Thus, the 
engineered dominant selectable marker, whose expression is dependent upon productive HCV 
KNA replication, can be used to select for adaptive mutations in either the HCV replication 
machinery or the trartsfected host cell, or both. In addition, dominant selectable markers can 
' be used to select for mutations in the HCV replication machinery that allow higher levels of 

1 0 RNA replication or particle formation. In one example, engineered HCV derivatives 

expressing a mutant form of DHFR can be used to confer resistance to methotrexate (MTX). 
As a dominant selectable marker, mutant DHFR is inefficient since nearly stoichiometric 
amounts are required for MTX resistance. By successively increasing concentrations of MTX 
in the medium, increased quantities of DHFR will be required for continued survival of cells 

15 harboring the replicating HCV RNA. This selection scheme, or similar ones based on this 
concept, can result in the selection of mutations in the HCV RNA replication machinery 
allowing higher levels of HCV RNA replication and RNA accumulation. Similar selections 
can be applied for mutations allowing production of higher yields of HCV particles in cell 
culture or for mutant HCV particles with altered cell tropism. Such selection schemes involve 

20 harvesting HCV particles from culture supernatants or after cell disruption and selecting for 
MTX-resistant transducing particles by reinfection of naive cells. 

Methods similar to the above can be used to establish adaptive variants with 
variations in characteristics such as the increased or decreased ability to cause infection, the 
ability to cause infection in a host that,wild-type strains are unable to infect, or cells of such a 

25 host. 

The invention also provides host cell lines transfected with any of the HCV DNA (or 
HCV RNA) as set forth above. Examples of host cells include, but are by no means limited 
to, the group consisting of a bacterial cell, a yeast cell, an insect cell, and a mammalian cell. 
Preferably, the host cell is capable of providing for expression of functional HCV RNA 
30 replicase, virions or virus particle proteins. 

In a related aspect, as briefly described above, the invention provides a vector for 
gene therapy or a gene vaccine (also termed herein a genetic vaccine), in which a 
heterologous protein is inserted into the HCV nucleic acid under conditions that permit 
expression of the heterologous protein. These vaccines can be either DNA or RNA. m 
35 particular, the invention provides an infectious hepatitis C virus (HCV) DNA vector 
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comprising from 5' to 3' on the positive-sense DNA, a promoter; an HCV 5'-non-translated 
region (NTR) containing the extreme 5'-terminal sequence GCCAGCC; an HCV polyprotein 
coding region comprising a coding region for a heterologous gene; and a 3' non-translated 
region (NTR). Preferably, the promoter is selected from the group consisting of 
5 bacteriophage T3, T7, and SP6. 

In the embodiments of the invention where the functional HCV nucleic acid is DNA, 
it may further comprise a promoter operatively associated with the 5' NTR. For example, but 
not by way of limitation, the promoter may be selected from the group consisting of 
bacteriophage T7, T3, and SP6. However, any suitable promoter for transcription of HCV 

1 0 genomic RNA corresponding to the HCV DNA can be used, depending on the specific 
transcription system employed. For example, for nuclear transcription (e.g., in an animal 
transgenic for HCV), an endogenous or viral promoter, such as CMV, may be used. 
Additionally, these promoter-driven HCV DNAs can be incorporated into an 
extrachromosomally replicating DNA such as a plasmid or a phage. 

15 Various uses of the invention variants are envisioned herein. Uses relevant to therapy 

and vaccine development include: (i) the generation of defined HCV virus stocks to develop 
in vitro and in vivo assays for virus neutralization, attachment, penetration and entry; (ii) 
structure/function studies on HCV proteins and RNA elements and identification of new 
antiviral targets; (iii) a systematic survey of cell culture systems and conditions to identify 

20 those that support wild-type and variant HCV RNA replication and particle release; (iv) 

production of adaptive HCV variants capable of more efficient replication in cell culture; (v) 
production of HCV variants with altered tissue or species tropism; (vi) establishment of 
alternative animal models for inhibitor evaluation including those supporting HCV variant 
replication; (vii) development of cell-free HCV replication assays; (vtii) production of 

25 immunogenic HCV particles for vaccination; (ix) engineering of attenuated HCV derivatives 
as possible vaccine candidates; (x) engineering of attenuated or defective HCV derivatives for 
expression of heterologous gene products for gene therapy and vaccine applications; (xi) 
utilization of the HCV glycoproteins for targeted delivery of therapeutic agents to the liver or 
other cell types with appropriate receptors. 

30 The invention further provides a method for infecting an animal with HCV variants, 

where the method comprises administering an infectious dose of HCV variant RNA prepared 
by transcription of infectious HCV variant DNA. The invention extends to a non-human 
animal infected with HCV variants or transfected with HCV variant RNA or DNA. Similarly, 
the invention provides a method for propagating infectious HCV variants in vitro comprising 

35 culturing a cell line contacted with an infectious amount of HCV variant RNA prepared by 
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transcription of the infectious HCV DNA, as well as an in vitro cell line infected with HCV 
variants. In a specific embodiment, the cell line is a hepatocyte cell line transfected or 
infected with an HCV variant in which an IRES-antibiotic resistance cassette has been 
engineered to provide for selection. The variant may also comprise the adaptive mutations 
5 described above. 

In accordance with the gene therapy (genetic vaccine) embodiment of the invention, 
also provided is a method for transducing an animal capable of HCV RNA replication with a 
heterologous gene, comprising administering an amount of an HCV variant RNA prepared by 
transcription of the HCV variant DNA vector. 

In another embodiment, the invention provides a method for producing HCV particle 
proteins comprising culturing a host expression cell line transfected with an HCV variant of 
the invention under conditions that permit expression of HCV particle proteins; and isolating 
HCV particle proteins from the cell culture. In a specific embodiment, such an expression 
cell line may be a cell selected from the group consisting of a bacterial cell, a yeast cell, an 
insect cell, and a mammalian cell. 

The invention further provides an HCV virion comprising an HCV variant RNA 
genome. Such virions can be used in an HCV vaccine, preferably after attenuation, e.g., by 
heat or chemical treatment, or through selection of attenuated variants by the methods 
described above. 

The in vivo and in vitro HCV variants of the invention permits controlled screening 
for anti-HCV agents (z.e., drugs for treatment of HCV), as well as for evaluation of drug 
resistance. An in vivo method for screening for agents capable of modulating HCV 
replication may comprise administering a candidate agent to an animal containing an HCV 
variant, and testing for an increase or decrease in a level of HCV variant infection, replication 
or activity compared to a level of HCV variant infection, replication or activity in the animal 
prior to administration of the candidate agent; wherein a decrease in the level of HCV variant 
infection, replication or activity compared to the level of HCV variant infection, replication or 
activity in the animal prior to administration of the candidate agent is indicative of the ability 
of the agent to inhibit HCV variant infection, replication or activity. Testing for the level of 
HCV variant infection or replication can involve measuring the viral titer (e.g., RNA levels) 
in a serum or tissue sample from the animal; testing for the level of HCV variant activity can 
involve measuring liver enzymes. Alternatively, an in vitro method for screening for agents 
capable of modulating HCV replication can comprise contacting a cell line supporting a 
replicating HCV variant with a candidate agent; and thereafter testing for an increase or 
decrease in a level of HCV variant replication or activity compared to a level of HCV variant 
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replication or activity in a control cell line or in the cell line prior to administration of the 
candidate agent, wherein a decrease in the level of HCV variant replication or activity 
compared to the level of HCV variant replication or activity in a control cell line or in the cell 
line prior to administration of the candidate agent is indicative of the ability of the agent to 
5 inhibit HCV variant replication or activity. In a specific embodiment, testing for the level of 
HCV variant replication in vitro may involve measuring the HCV titer, (e.g., RNA levels) in 
the cell culture; testing for the level of HCV activity in vitro may involve measuring HCV 
replication. 

In addition to the specific HCV variant DNA clones and related HCV variant RNAs, 
10 the invention is directed to a method for preparing an HCV variant DNA clone that is capable 

of replication in a host or host cell line, comprising joining from 5' to 3' on the positive-sense 

DNA a promoter; an HCV 5' non-translated region (NTR) an HCV polyprotein coding region; 

and a 3' non-translated region (NTR), where at least one of these regions is not a naturally 

occurring region. Preferably, the promoter is selected from the group consisting of 
15 bacteriophage T7, T3, and SP6. In a specific embodiment, the extreme 5'-tenninal sequence 

is homologous to SEQ ID NO:l, e.g., the 5'-terminal sequence may be selected from the 

group consisting of GCCAGCC; GGCCAGCC; UGCCAGCC; AGCCAGCC; 

AAGCCAGCC; GAGCCAGCC; GUGCCAGCC; and GCGCCAGCC, wherein the sequence 

GCCAGCC is the 5'-terminus of SEQ ID NO:l. 
20 The 3'-NTR poly-U for use in the method of preparing an HCV variant DNA clone 

may include a long poly-U region. Similarly, the 3'-NTR extreme terminus may be RNA 

homologous to a DNA having the sequence 

5'-TGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCC 
GCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCTGATCATGT-3' (SEQ ID 

25 NO:2); in a specific embodiment, the 3'-NTR extreme terminus has the foregoing sequence. 

Components of functional HCV variant DNA clones. Components of the functional 
HCV variant DNA described in this invention can be used to develop cell-free, cell culture, 
and animal-based screening assays for known or newly identified HCV antiviral targets as 
described infra. For each selected target, it is preferred that the HCV variant used has the 

30 wild-type form of the target Examples of known or suspected targets and assays include [see 
Houghton, In "Fields Virology" (B. N. Fields, D. M. Knipe and P. M. Howley, Eds.), Vol. 
pp. 1035-1058. Raven Press, New York (1996); Rice, (1996) supra; Rice et aL, Antiviral 
Therapy 1, Suppl. 4, 11-17 (1997); Shimotohno, Hepatology 21,:887-8 (1995) for reviews], 
but are not limited to, the following: 
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The highly conserved 5' NTR, which contains elements essential for translation of the 
incoming HCV genome RNA, is one target. It is also likely that this sequence, or its 
complement, contains RNA elements important for RNA replication and/or packaging. 
Potential therapeutic strategies include: antisense oligonucleotides (supra); trans-acting 
5 ribozymes (supra); RNA decoys; small molecule compounds interfering with the function of 
this element (these could act by binding to the RNA element itself or to cognate viral or 
cellular factors required for activity). 

Another target is the HCV C (capsid or core) protein, which is highly conserved and 
is associated with the following functions: RNA binding and specific encapsidation of HCV 

10 genome RNA; transcriptional modulation of cellular [Ray et al, Virus Res. 37: 209-220 

(1995)] and other viral [Shih et al, J. Virol. 69: 1160-1 171 (1995); Shih et al, J. Virol. 67: 
5823-5832 (1993)] genes; binding of cellular helicase [You et al., J. Virol 73:2841-2853 
(1999)]; cellular transformation [Ray et al, J. Virol. 70: 4438-4443 (1996a); Ray et al., J. 
Biol. Chem. 272:10983-10986(1997)]; prevention ofapoptosis [Ray et al, Virol 226: 

15 176-182 (1996b)]; modulation of host immune response through binding to members of the 
TNF receptor superfamily [Matsumoto et al, J. Virol. 71: 1301-1309 (1997)]. 

The El, E2, and perhaps the E2-p7 glycoproteins that form the components of the 
virion envelope are targets for potentially neutralizing antibodies. Key steps where 
intervention can be targeted include: signal peptidase mediated cleavage of these precursors 

20 from the polyprotein [Lin et al, (1994a ) supra]; ER assembly of the E1E2 glycoprotein 
complex and association of these proteins with cellular chaperones and folding machinery 
[Dubuisson et al, (1994) supra; Dubuisson and Rice, J. Virol. 70:778-786(1996)]; 
assembly of virus particles including interactions between the nucleocapsid and virion 
envelope; transport and release of virus particles; the association of virus particles with host 

25 components such as VLDL [Hijikata et al, (1993) supra; Thomssen et al, (1992) supra; 
Thomssen et al, Med. Microbiol. Immunol. 182: 329-334 (1993)] which may play a role in 
evasion of immune surveillance or in binding and entry of cells expressing the LDL receptor; 
conserved and variable determinants in the virion which are targets for neutralization by 
antibodies or which bind to antibodies and facilitate immune-enhanced infection of cells via 

30 interaction with cognate Fc receptors; conserved and variable determinants in the virion 
important for receptor binding and entry; virion determinants participating in entry, fusion 
with cellular membranes, and uncoating the incoming viral nucleocapsid. 

The NS2-3 autoprotease, which is required for cleavage at the 2/3 site is a further 

target. 
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The NS3 serine protease and NS4A cofactor which form a complex and mediate four 
cleavages in the HCV polyprotein [see Rice, (1997) supra for review) is yet another suitable 
target. Targets include the serine protease activity itself; the tetrahedral Zn 2+ coordination site 
in the C-terminal domain of the serine protease; the NS3-NS4A cofactor interaction; the 
5 membrane association of NS4A; stabilization of NS3 by NS4A; transforming potential of the 
NS3 protease region [Sakamuro et al, J Virol 69: 3893-6 (1995)]. 

The NS3 RNA-stimulatedNTPase [Suzich et al, (1993) supra], RNAhelicase [Jin 
and Peterson, Arch Biochem Biophys 323: 47-53 (1995); Kim et al, Biochem Biophys. Res. 
Commun. 215: 160-6 (1995)], and RNA binding [Kanai et al, FEBSLett 376: 221-4 (1995)] 
10 activities; the NS4A protein as a component of the RNA replication complex is another 
potential target. 

The NS5 A protein, another replication component, represents another target. This 
protein is phosphorylated predominantly on serine residues [Tanji et al, J. Virol. 69: 
3980-3986 (1995)]. Transcription modulating, cell growth promoting, and apoptosis 

15 inhibiting activities ofNS5A [Ghosh et al., J. Biol. Chem. 275:7184-7188 (2000)] can be 
targeted. Other characteristics of NS5A that could be targets for therapy include the kinase 
responsible for NS5A phosphorylation and its interaction with NS5 A, and the interaction with 
NS5 A and other components of the HCV replication complex. 

The NS5B RNA-dependent RNA polymerase, which is the enzyme responsible for 

20 the actual synthesis of HCV positive and negative-strand RNAs, is another target. Specific 
aspects of its activity include the polymerase activity itself [Behrens et al, EMBO J. 15: 
12-22 (1996)]; interactions of NS5B with other replicase components, including the HCV 
RNAs; steps involved in the initiation of negative- and positive-strand RNA synthesis; 
phosphorylation of NS5B [Hwang et al, Virology 227:438 (1997)]. 

25 Other targets include structural or nonstractural protein functions important for HCV 

RNA replication and/or modulation of host cell function. Possible hydrophobic protein 
components capable of forming channels important for viral entry, egress or modulation of 
host cell gene expression may be targeted. 

The 3' NTR, especially the highly conserved elements (poly (U/UC) tract; 98-base 

30 terminal sequence) can be targeted. Therapeutic approaches parallel those described for the 5' 
NTR, except that this portion of the genome is likely to play a key role in the initiation of 
negative-strand synthesis. It may also be involved in other aspects of HCV RNA replication, 
including translation, RNA stability, or packaging. 

Hie functional HCV variants of the present invention may encode all of the viral 

35 proteins and RNA elements required for RNA packaging. These elements can be targeted for 
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development of antiviral compounds. Electrophoretic mobility shift, UV cross-linking, filter 
binding, and three-hybrid [SenGupta et al., Proc. Natl. Acad. Sci. USA 93: 8496-8501 
(1996)] assays can be used to define the protein and RNA elements important for HCV RNA 
packaging and to establish assays to screen for inhibitors of this process. Such inhibitors 
might include small molecules or RNA decoys produced by selection in vitro [Gold et al, 
(1995) supra}. 

Complex libraries of the variants of the present invention can be prepared using PCR 
shuffling, or by incorporating randomized sequences, such as are generated in "peptide 
display" libraries. Using the "phage method" [Scott and Smith, 1990, Science 249:386-390 
(1990); Cwirla, et al., Proc. Natl Acad. Sci USA., 57:6378-6382 (1990); Devlin et al., 
Science, 249:404-406 (1990)], very large libraries can be constructed (10 6 -10 8 chemical 
entities). Clones from such libraries can be used to generate other variants or chimeras, e.g., 
using various HCV subtypes. Such variants can be generated by methods known in the art, 
without undue experimentation. 

A clone that includes a primer and run-off sequence can be used directly for 
production of functional HCV variant RNA. A large number of vector-host systems known in 
the art may be used. Examples of vectors include, but are not limited to, E. coli, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 derivatives or pUC 
plasmid derivatives, e.g., pGEX vectors, pmal-c, pFLAG, pTET, etc. As is well known, the 
insertion into a cloning vector can, for example, be accomplished by ligating the DNA 
fragment into a cloning vector that has complementary cohesive termini. However, if the 
complementary restriction sites used to fragment the DNA are not present in the cloning 
vector, the ends of the DNA molecules maybe enzymatically modified. Alternatively, any 
site desired could be produced by ligating nucleotide sequences (linkers) onto the DNA 
termini; these ligated linkers may comprise specific chemically synthesized oligonucleotides 
encoding restriction endonuclease recognition sequences. Recombinant molecules can be 
introduced into host cells via transformation, transfection, infection, electroporation, etc., so 
that many copies of the gene sequence are generated. 

Expression of HCV RNA and Polypeptides 
The HCV variant DNA, which codes for HCV variant RNA and HCV proteins, 
particularly HCV RNA replicase or virion proteins, can be inserted into an appropriate 
expression vector, i.e., a vector which contains the necessary elements for the transcription 
and translation of the inserted protein-coding sequence. Such elements are termed herein a 
"promoter." Thus, the HCV variant DNA of the invention is operationally (or operably) 
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associated with a promoter in an expression vector of the invention. An expression vector 
also preferably includes a replication origin. The necessary transcriptional and translational 
signals can be provided on a recombinant expression vector. In a preferred embodiment for in 
vitro synthesis of functional RNAs, the T7, T3, or SP6 promoter is used. 
5 Potential host- vector systems include but are not limited to mammalian cell systems 

infected with virus recombinant (e.g., vaccinia virus, adenovirus, Sindbis virus, SemliM 
Forest virus, etc.); insect cell systems infected with recombinant viruses (e.g., baculovirus); 
microorganisms such as yeast containing yeast vectors; plant cells; or bacteria transformed 
with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression elements of 

10 vectors vary in their strengths and specificities. Depending on the host-vector system utilized, 
any one of a number of suitable transcription and translation elements may be used. 

The cell into which the recombinant vector comprising the HCV variant DNA clone 
has been introduced is cultured in an appropriate cell culture medium under conditions that 
provide for expression of HCV RNA or such HCV proteins by the cell. Any of the methods 

15 previously described for the insertion of DNA fragments into a cloning vector may be used to 
construct expression vectors containing a gene consisting of appropriate 
transcriptional/translational control signals and the protein coding sequences. These methods 
may include in vitro recombinant DNA and synthetic techniques and in vivo recombination 
(genetic recombination). 

20 Expression of HCV variant RNA or protein may be controlled by any 

promoter/enhancer element known in the art, but these regulatory elements must be functional 
in the host selected for expression. Promoters which may be used to control expression 
include, but are not limited to, the SV40 early promoter region (Benoist and Chambon, 1981, 
Nature 290:304-3 10), the promoter contained in the 3' long terminal repeat of Rous sarcoma 

25 virus (Yamamoto, et al, 1980, Cell 22:787-797), the herpes thymidine kinase promoter 

(Wagner et al, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences 
of the metallothionein gene (Brinster et al, 1982, Nature 296:39-42); prokaryotic expression 
vectors such as the P-lactamase promoter (Villa-Kamaroff, et al, 1978, Proc. Natl. Acad. Sci. 
U.S.A. 75:3727-3731), or the tac promoter (DeBoer, et al, 1983, Proc. Natl. Acad. Sci. 

30 U.S.A. 80:21-25); promoter elements from yeast or other fungi such as the Gal 4 promoter, 
the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, 
alkaline phosphatase promoter; and the animal transcriptional control regions, which exhibit 
tissue specificity and have been utilized in transgenic animals: elastase I gene control region 
which is active in pancreatic acinar cells (Swift et al, 1984, Cell 38:639-646; Ornitz et al, 

35 • 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 
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7:425-515); insulin gene control region which is active in pancreatic beta cells (Hanahan, 
1985, Nature 3 15:1 15-122), immunoglobulin gene control region which is active in lymphoid 
cells (Grosschedl et al, 1984, Cell 38:647-658; Adames et al., 1985, Nature 318:533-538; 
Alexander et al, 1987, Mol. Cell. Biol. 7:1436-1444), mouse mammary tumor virus control 

5 region which is active in testicular, breast, lymphoid and mast cells (Leder et al, 1986, Cell 
45:485-495), albumin gene control region which is active in liver (Pinkert et al, 1987, Genes 
and Devel. 1 :268-276), alpha-fetoprotein gene control region which is active in liver 
(Krumlauf et al, 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et al, 1987, Science 235:53- 
58), alpha 1-antitrypsin gene control region which is active in the liver (Kelsey et al, 1987, 

10 Genes and Devel. 1:161-171), beta-globin gene control region which is active in myeloid cells 
(Mogram et al, 1985, Nature 315:338-340; Kollias et al, 1986, Cell 46:89-94), myelin basic 
protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et 
al, 1987, Cell 48:703-712), myosin light chain-2 gene control region which is active in 
skeletal muscle (Sani, 1985, Nature 3 14:283-286), and gonadotropic releasing hormone gene 

15 control region which is active in the hypothalamus (Mason et al, 1986, Science 234: 1372- 
1378). 

A wide variety of hosl/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for example, 
may consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. 

20 Suitable vectors include derivatives of S V40 and known bacterial plasmids, e.g. , E. coli 

plasmids col El, pCRl, pBR322, pMal-C2, pET, pGEX [Smith et al, 1988, Gene 67:31-40], 
pMB9 and their derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous 
derivatives of phage A, e.g., NM989, and other phage DNA e.g., M13 and filamentous single 
stranded phage DNA; yeast plasmids such as the 2|i plasmid or derivatives thereof; vectors 

25 useful in eukaryotic cells, such as vectors useful in insect or mammalian cells; vectors derived 
from combinations of plasmids and phage DNAs, such as plasmids that have been modified to 
employ phage DNA or other expression control sequences; and the like known in the art. 

In addition to the preferred sequencing analysis, expression vectors containing an 
HCV variant DNA clone of the invention can be identified by four general approaches: (a) 

30 PCR amplification of the desired plasmid DNA or specific rnRNA, (b) nucleic acid 

hybridization, (c) presence or absence of selection marker gene functions, (d) analysis with 
appropriate restriction endonucleases and (e) expression of inserted sequences. In the first 
approach, the nucleic acids can be amplified by PCR to provide for detection of the amplified 
product. In the second approach, the presence of nucleic acids in an expression vector can be 

35 detected by nucleic acid hybridization using probes comprising sequences that are 
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homologous to the HCV variant DNA. In the third approach, the recombinant vector/host 
system can be identified and selected based upon the presence or absence of certain "selection 
marker" gene functions (e.g., P-galactosidase activity, thymidine kinase activity, resistance to 
antibiotics, transformation phenotype, occlusion body formation in baculovirus, etc.) caused 
5 by the insertion of foreign genes in the vector. In the fourth approach, recombinant 

expression vectors are identified by digestion with appropriate restriction enzymes. In the 
fifth approach, recombinant expression vectors can be identified by assaying for the activity, 
biochemical, or immunological characteristics of the gene product expressed by the 
recombinant, e.g., HCV RNA, HCV virions, or HCV viral proteins. 

1 0 For example, in a baculovirus expression systems, both non-fusion transfer vectors, 

such as but not limited to pVL941 (BamHL cloning site; Summers), pVL1393 (BamHI, Smal, 
Xbal, EcdRl, Notl, XmaLH, BglH, and PsiL cloning site; Invitrogen), pVL1392 (BglH, PstI, 
NotI, XmaLLL, EcoRI, Xbal, Smal, and BamHL cloning site; Summers and Invitrogen), and 
pBlueBacm (BamBI, BglLl, PstI, Ncol, and HindUL cloning site, with blue/white recombinant 

15 screening possible; Invitrogen), and fusion transfer vectors, such as but not limited to pAc700 
(BamHL and Kpnl cloning site, in which the BamHL recognition site begins with the initiation 
codon; Summers), pAc701 and pAc702 (same as pAc700, with different reading frames), . 
pAc360 (BamHI cloning site 36 base pairs downstream of a polyhedrin initiation codon; 
Invitrogen(195)), andpBlueBacHisA, B, C (three different reading frames, with .BamHI, 

20 BglLL, PstL, Ncol, and HindUL cloning site, an N-terminal peptide for ProBond purification, 
and blue/white recombinant screening of plaques; Invitrogen) can be used. 

Examples of mammalian expression vectors contemplated for use in the invention 
include vectors with inducible promoters, such as the dihydrofolate reductase (DHFR) 
promoter, e.g., any expression vector with a. DHFR expression vector, or a 

25 Di2Fi?/methotrexate co-amplification vector, such as pED (PstL, Sail, Sbal, Smal, and EcdBI 
cloning site, with the vector expressing both the cloned gene and DHFR); [see Kaufman, 
Current Protocols in Molecular Biology, 16.12 (1991)]. Alternatively, a glutamine 
synthetase/methionine sulfoximine co-amplification vector, such as pEE14 (HindLLL, Xbal, 
Smal, Sbal, EcoHI, and BcR cloning site, in which the vector expresses glutamine synthase 

30 and the cloned gene; Celltech). In another embodiment, a vector that directs episomal 

expression under control of Epstein Barr Virus (EBV) can be used, such as pREP4 (BamHL, 
SjiL, Xhol, Notl, NheL, HindUL, Nhel, PvuSL, and KpnL cloning site, constitutive RSV-LTR 
promoter, hygromycin selectable marker; Invitrogen), pCEP4 (BamHI, SfiL, XhoL, Notl, Nhel, 
HindLLL, NheL, PvuH, and Kpnl cloning site, constitutive hCMV immediate early gene, 

35 hygromycin selectable marker; Invitrogen), pMEP4 (KpnL, Pvul, Nhel, HindLU, Notl, Xltol, 
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Sfil, BamHI cloning site, inducible methallothionein Ha gene promoter, hygromycin 
selectable marker: Invitrogen), pREP8 (BamHI, Xhol, Noil, Hindm, Nhel, and Kpnl cloning 
site, RSV-LTR promoter, bistidinol selectable marker; Invitrogen), pREP9 (Kpnl, Nhel, 
Hindm, Noil, Xhol, Sfil, and BamHI cloning site, RSV-LTR promoter, G418 selectable 

5 marker; Invitrogen), and pEBVHis (RSV-LTR promoter, hygromycin selectable marker, N- 
terminal peptide purifiable via ProBond resin and cleaved by enterokinase; Invitrogen). 
Regulatable mammalian expression vectors, can be used, such as Tet and rTet [Gossen and 
Bujard, Proc. Natl. Acad. Sci. USA 89:5547-51 (1992); Gossen et al, Science 268:1766-1769 
(1995)]. Selectable mammalian expression vectors for use in the invention include pRc/CMV 

10 (Hindm, BsiXl, Noil, Sbal, and Apal cloning site, G41 8 selection; Invitrogen), pRc/RSV 
(Hindm, Spel, BsiKL, Noil, Mai cloning site, G418 selection; Invitrogen), and others. 
Vaccinia virus mammalian expression vectors [see, Kaufman (1991) supra] for use according 
to the invention include but are not limited to pSCl 1 (Smal cloning site, IK- and p-gal 
selection), pMJ601 (Sail, Smal, Afll, Narl, BspMSL, BamHI, Apal, Nhel, SacR, Kpnl, and 

15 Hindm cloning site; TK- and p-gal selection), and pTKgptFIS (EcdRL, Pstl, Sail, Accl, 
HindQ., Sbal, BamHL, and Hpa cloning site, TK or XPRT selection). 

Examples of yeast expression systems include the non-fusion pYES2 vector (Xbal, 
Sphl, Shol, Noil, GsiXL, EcdSl, BsiXl, BamHI, Sacl, Kpnl, and Hindm cloning sit; 
Invitrogen) or the fusion pYESHisA, B, C (Xba\ Sphl, Shol, Noil, BsiXl, EcdSI, BamHI, 

20 Sacl, Kpnl, and Hindm cloning site, N-terminal peptide purified with ProBond resin and 

cleaved with enterokinase; Invitrogen), to mention just two, can be employed according to the 
invention. 

In addition, a host cell strain may be chosen that modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion desired. 

25 Different host cells have characteristic and specific mechanisms for the translational and post- 
translational processing and modification (e.g., glycosylation, cleavage [e.g., of signal 
sequence]) of proteins. Expression in yeast can produce a glycosylated product Expression 
in eukaryotic cells can increase the likelihood of "native" glycosylation and folding of an 
HCV protein. Moreover, expression in mammalian cells can provide a tool for reconstituting, 

30 or constituting, native HCV virions or virus particle proteins. 

A variety of transfection methods, useful for other RNA virus studies, can be utilized 
herein without undue experimentation. Examples include microinjection, cell fusion, 
calcium-phosphate cationic liposomes such as lipofectin [Rice et al., New Biol. 1:285-296 
(1989); see "HCV-based Gene Expression Vectors", infra], DE-dextran [Rice et al., J. Virol. 

35 61: 3809-3819 (1987)], and electroporation [Bredenbeek et al, J. Virol. 67: 6439-6446 



WO 01/89364 



PCT/US01/16822 



40 

(1993);Liljestr6me*a/., J.Virol. 65:4107-4113(1991)]. Scrape loading [Kumar etal, 
Biochem. Mol. Biol. Int. 32: 1059-1066 (1994)] and ballistic methods [Burkholder et al, J. 
Immunol. Meth. 165: 149-156 (1993)] may also be considered for cell types refractory to 
transfection by these other methods. A DNA vector transporter may be considered [see, e.g., 
5 Wu et al, 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621- 
14624; Hartmut etal, Canadian Patent Application No. 2,012,311, filed March 15, 1990]. 

In Vitro Transfection With HCV Variants 
Identification of cell lines supporting HCV replication. An important aspect of the 

10 invention is a method it provides for developing new and more effective anti-HCV therapy by 
conferring the ability to evaluate the efficacy of different therapeutic strategies using an 
authentic and standardized in vitro HCV variant replication system. Such assays are 
invaluable before moving on to trials using rare and valuable experimental animals, such as 
the chimpanzee, or HCV-infected human patients. The adaptive variants of the invention are 

15 particularly useful for this work because their growth in culture and their ability to withstand 
subpassage is superior to wild-type strains. Also, the replicons disclosed herein are useful 
because replication can be evaluated without the confounding effects of the structural 
proteins. 

The HCV variant infectious clone technology can also be used to establish in vitro 

20 and in vivo systems for analysis of HCV replication and packaging. These include, but are 
not restricted to, (i) identification or selection of permissive cell types (for RNA replication, 
virion assembly and release); (if) investigation of cell culture parameters (e.g., varying culture 
conditions, cell activation, etc.) or selection of adaptive mutations that increase the efficiency 
of HCV replication in cell cultures; and (iii) definition of conditions for efficient production 

25 of infectious HCV variant particles (either released into the culture supernatant or obtained 
after cell disruption). These and other readily apparent extensions of the invention have broad 
utility for HCV therapeutic, vaccine, and diagnostic development. 

General approaches for identifying permissive cell types are outlined below. Optimal 
methods for RNA transfection (see also, supra) vary with cell type and are determined using 

30 RNA reporter constructs. These include, for example, the bicistronic replicons disclosed 
supra and in the Examples, and bicistronic virus [Wang et al., J. Virol. 67: 3338-44 (1993)] 
with the structure 5'-CAT-HCV IRES-LUC-3'. These HCV variants are used both to 
optimize transfection conditions (using, e.g., by measuring P-galactosidase or CAT 
[chloramphenicol acetyltransferase] activity to determine transfection efficiency) and to 

35 determine if the cell type is permissive for HCV IRES-mediated translation (e.g., by 
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measuring LUC; luciferase activity). For actual HCV RNA transfection experiments, 
cotransfection with a 5' capped luciferase reporter RNA [Wang et al, (1993) supra] provides 
an internal standard for productive transfection and translation. Examples of cell types 
potentially permissive for HCV replication include, but are not restricted to, primary human 
5 cells (e.g., hepatocytes, T-cells, B-cells, foreskin fibroblasts) as well as continuous human cell 
lines (e.g, HepG2, Huh7, HUT78, HPB-Ma, MT-2, MT-2C, and other HTLV-1 and HTLV-H 
infected T-cell lines, Namalawa, Daudi, EBV-transformed LCLs). In addition, cell lines of 
other species, especially those which are readily transfected with RNA and permissive for 
replication of flaviviruses or pestivimses (e.g., SW-13, Vero, BHK-21, COS, PK-15, MBCK, 

10 etc.), can be tested. Cells are transfected using a method as described supra. 

For replication assays, RNA transcripts are prepared using the HCV variant and the 
corresponding non-functional, e.g., AGDD (see Examples) derivative as a negative control, 
for persistence of HCV RNA and antigen in the absence of productive replication. Template 
DNA (which complicates later analyses) is removed by repeated cycles of DNasel treatment 

1 5 and acid phenol extraction followed by purification by either gel electrophoresis or gel 

filtration, to preferably achieve less than one molecule of amplifiable DNA per 10 9 molecules 
of transcript RNA. DNA-free RNA transcripts are mixed with LUC reporter RNA and used 
to transfect cell cultures using optimal conditions determined above. After recovery of the 
cells, RNaseA is added to the media to digest excess input RNA and the cultures incubated 

20 for various periods of time. An early timepoint (~1 day post-transfection) will be harvested 
and analyzed for LUC activity (to verify productive transfection) and positive-strand RNA 
levels in the cells and supernatant (as a baseline). Samples are collected periodically for 2-3 
weeks and assayed for positive-strand RNA levels by QC-RT/PCR [see Kolykhalov et al., 
(1996) supra]. Cell types showing a clear and reproducible difference between the intact 

25 infectious transcript and the non-functional derivative, e.g., AGDD deletion, control can be 
subjected to more thorough analyses to verify authentic replication. Such assays include 
measurement of negative-sense HCV RNA accumulation by QC-RT/PCR [Gunji et al., 
(1994) supra; Lanford et al, Virology 202: 606-14 (1994)], Northern-blot hybridization, or 
metabolic labeling [Yoo et al, (1995) supra] and single cell methods, such as in situ 

30 hybridization [ISH; Gowans et al., In "Nucleic Acid Probes" (R. H. Symons, Eds.), Vol. pp. 
139-158. CRC Press, Boca Raton. (1989)], in situ PCR [followed by ISH to detect only HCV- 
specific amplification products; Haase et al., Proc. Natl. Acad. Sci. USA 87: 4971-4975 
(1990)], and irnmunohistochemistry. 

HCV particles for studying virus-receptor interactions. In combination with the 

35 identification of cell lines that are permissive for HCV replication, defined HCV variant 
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stocks can be used to evaluate the interaction of the HCV with cellular receptors. Assays can 
be set up which measure binding of the virus to susceptible cells or productive infection, and 
then used to screen for inhibitors of these processes. 

Identification of cell lines for characterization of HCV receptors. Cell lines 
5 permissive for HCV RNA replication, as assayed by RNA transfection, can be screened for 
their ability to be infected by the virus using the HCV variants of the present invention. Cell 
lines permissive for RNA replication but which cannot be infected by the homologous virus 
may lack one or more host receptors required for HCV binding and entry. Such cells provide 
valuable tools for (i) functional identification and molecular cloning of HCV receptors and 

10 co-receptors; (ii) characterization of virus-receptor interactions; and (iii) developing assays to 
screen for compounds or biologies (e.g. , antibodies, SELEX RNAs [Bartel and Szostak, In 
"RNA-protein interactions" (K. Nagai and L W. Mattaj, Eds.), Vol. pp. 82-102. IRL Press, 
Oxford (1995); Gold et al, Anna. Rev. Biochem. 64: 763-797 (1995)], etc.) that inhibit these 
interactions. Once defined in this manner, these HCV receptors serve not only as therapeutic 

1 5 targets but may also be expressed in transgenic animals rendering them susceptible to HCV 
infection [Koike et al, Dev Biol Stand 78: 101-7 (1993); Ren and Racaniello, J Virol 66: 
296-304 (1992)]. Such transgenic animal models supporting HCV replication and spread 
have important applications for evaluating anti-HCV drugs. 

The ability to manipulate the HCV glycoprotein structure may also be used to create 

20 HCV variants with altered receptor specificity. In one example, HCV glycoproteins can be 
modified to express a heterologous binding domain for a known cell surface receptor. The 
approach should allow the engineering of HCV derivatives with altered tropism and perhaps 
extend infection to non-chimeric small animal models. 

Alternative approaches for identifying permissive cell lines. As previously discussed, 

25 and as exemplified in the Examples, functional HCV variants can be engineered that comprise 
selectable markers for HCV replication. For instance, genes encoding dominant selectable 
markers can be expressed as part of the HCV polyprotein, or as separate cistrons located in 
permissive regions of the HCV RNA genome. 

30 Animal Models for HCV Infection and Replication 

In addition to chimpanzees, the present invention permits development of alternative 
animal models for studying HCV replication and evaluating novel therapeutics. Using clones 
of the authentic HCV variants described in this invention as starting material, multiple 
approaches can be envisioned for establishing alternative animal models for HCV replication. 

35 In one manifestation, the variants could be used to inoculate immunodeficient mice harboring 
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human tissues capable of supporting HCV replication. An example of this art is the SCID:Hu 
mouse, where mice with a severe combined immunodeficiency are engrafted with various 
human (or chimpanzee) tissues, which could include, but are not limited to, fetal liver, adult 
liver, spleen, or peripheral blood mononuclear cells. Besides SCID mice, normal irradiated 

5 mice can serve as recipients for engraftment of human or chimpanzee tissues. These chimeric 
animals would men be substrates for HCV replication after either ex vivo or in vivo infection 
with defined virus-containing inocula. 

In another manifestation, adaptive mutations allowing HCV replication in alternative 
species may produce variants that are permissive for replication in these animals. For 

10 instance, adaptation of HCV for replication and spread in either continuous rodent cell lines 
or primary tissues (such as hepatocytes) could enable the virus to replicate in small rodent 
models. Alternatively, complex libraries of HCV variants created by DNA shuffling 
[Stemmer, Proc. Natl. Acad. Sci. USA 91:10747 (1994)] or other methods known in the art 
can be created and used for inoculation of potentially susceptible animals. Such animals 

15 could be either immunocompetent or immunodeficient, as described above. 

The functional activity of HCV variants can be evaluated transgenically. In this 
respect, a transgenic mouse model can be used [see, e.g., Wilmut et ah, Experientia 47:905 
(1991)]. The HCV RNA or DNA clone can be used to prepare transgenic vectors, including 
viral vectors, plasmid or cosmid clones (or phage clones). Cosmids may be introduced into 

20 transgenic mice using published procedures [Jaenisch, Science, 240: 1468-1474 (1988)]. In 
the preparation of transgenic mice, embryonic stein cells are obtained from blastocyst 
embryos [Joyner, In Gene Targeting: A Practical Approach. The Practical Approach Series, 
Rickwood, D., and Hames, B. D. 5 Eds., IRL Press: Oxford (1993)] and transfected with HCV 
variant DNA or KNA. Transfected cells are injected into early embryos, e.g., mouse 

25 embryos, as described {Hammer et at, Nature 315:680 (1985); Joyner, supra]. Various 
techniques for preparation of transgenic animals have been described [U.S. Patent No. 
5,530,177, issued June 25, 1996; U.S. Patent No. 5,898,604, issued December 31, 1996]. Of 
particular interest are transgenic animal models in which the phenotypic or pathogenic effects 
of a transgene are studied. For example, the effects of a rat phosphoenolpyruvate 

30 carboxytonase-bovine growth hormone fusion gene has been studied in pigs fWieghart et al, 
J. Reprod. Fert., Suppl. 41:89-96 (1996)]. Transgenic mice that express of a gene encoding a 
human amyloid precursor protein associated with Alzheimer's disease are used to study this 
disease and other disorders [International Patent Publication WO 96/06927, published March 
7, 1996; Quon et al, Nature 352:239 (1991)]. Transgenic mice have also been created for the 

35 hepatitis delta agent [Polo et al, J. Virol. 69:5203 (1995)] and for hepatitis B virus [Chisari, 
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Curr. Top. Microbiol. Immunol. 206:149 (1996)], and replication occurs in these engineered 
animals. 

Thus, the functional HCV variants described here, or parts thereof, can be used to 
create transgenic models relevant to HCV replication and pathogenesis. In one example, 
5 transgenic animals harboring the entire genome of an HCV variant can be created. 

Appropriate constructs for transgenic expression of the entire HCV variant genome in a 
transgenic mouse of the invention could include a nuclear promoter engineered to produce 
transcripts with the appropriate 5' terminus, the full-length HCV variant cDNA sequence, a 
cis-cleaving delta ribozyme (Ball, J. Virol. 66: 2335-2345 (1992); Pattnaik et al., Cell 69: 

10 101 1-1020 (1992)] to produce an authentic 3' terminus, followed possibly by signals that 
promote proper nuclear processing and transport to the cytoplasm (where HCV RNA 
replication occurs). Besides the entire HCV variant genome, animals can be engineered to 
express individual or various combinations of HCV proteins and RNA elements. For 
example, animals engineered to express an HCV gene product or reporter gene under the 

15 control of the HCV IRES can be used to evaluate therapies directed against this specific RNA 
target. Similar animal models can be envisioned for most known HCV targets. 

Such alternative animal models are useful for (i) studying the effects of different 
antiviral agents on replication of HCV variants, including replicons, in a whole animal 
system; (ii) examining potential direct cytotoxic effects of HCV gene products on hepatocytes 

20 and other cell types, defining the underlying mechanisms involved, and identifying and 
testing strategies for therapeutic intervention; and (iii) studying immune-mediated 
mechanisms of cell and tissue damage relevant to HCV pathogenesis and identifying and 
testing strategies for interfering with these processes. 

25 Selection and Analysis of Drug-Resistant Variants 

Cell lines and animal models supporting HCV replication can be used to examine the 
emergence of HCV variants with resistance to existing and novel therapeutics. Like all RNA 
viruses, the HCV replicase is presumed to lack proofreading activity and RNA replication is 
therefore error prone, giving rise to a high level of variation [Bukh et al, (1995) supra]. The 

30 variability manifests itself in the infected patient over time and in the considerable diversity 
observed between different isolates. The emergence of drug-resistant variants is likely to be 
an important consideration in the design and evaluation of HCV mono and combination 
therapies. HCV replication systems of the invention can be used to study the emergence of 
variants under various therapeutic formulations. These might include monotherapy or various 

35 combination therapies (e.g., IFN-a, ribavirin, and new antiviral compounds). Resistant 
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evaluate new therapeutic formulations, or in screening assays for effective anti-HCV drugs 
(infra). 

' 5 Screening For Anti-HCV Agents 

HCV-permissive cell lines or animal models (preferably rodent models) comprising 
adaptive HCV variants can be used to screen for novel inhibitors or to evaluate candidate anti- 
HCV therapies. Such therapies include, but would not be limited to, (i) antisense 
oligonucleotides or ribozymes targeted to conserved HCV RNA targets; (ii) injectable 

1 0 compounds capable of inhibiting HCV replication; and (iii) orally bioavailable compounds 
capable of inhibiting HCV replication. Targets for such formulations include, but are not 
restricted to, (i) conserved HCV RNA elements important for RNA replication and RNA 
packaging; (ii) HCV-encoded enzymes; (iii) protein-protein and protein-RNA interactions 
important for HCV RNA replication, virus assembly, virus release, viral receptor binding, 

15 viral entry, and initiation of viral RNA replication; (iv) virus-host interactions modulating the 
ability of HCV to establish chronic infections; (v) virus-host interactions modulating the 
severity of liver damage, including factors affecting apoptosis andhepatotoxicity; (vi) virus- 
host interactions leading to the development of more severe clinical outcomes including 
cirrhosis and hepatocellular carcinoma; and (vii) virus-host interactions resulting in other, less 

20 frequent, HCV-associated human diseases. 

Evaluation of antisense and ribozyme therapies. The present invention extends to the 
preparation of antisense nucleotides and ribozymes that may be tested for the ability to 
interfere with HCV replication. This approach utilizes antisense nucleic acid and ribozymes 
to block translation of a specific mRNA, either by masking that mRNA with an antisense 

25 nucleic acid or cleaving it with a ribozyme. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at 
least a portion of a specific mRNA molecule. Reviews of antisense technology include: 
Baertschi, Mol. Cell. Endocrinol. 101:R15-R24 (1994); Crooke et al., Annu. Rev. Pharmacol. 
Toxicol. 36:107-129 (1996); Alama et al., Pharmacol. Res. 36:171-178; and Boyer et al., J. 

30 Hepatol. 32(1 Suppl):98-1 12(2000). The last review discusses antisense technology as it 
applies to HCV. 

In the cell, they hybridize to that mRNA, forrning a double stranded DNA:RNA or 
RNA:RNA molecule. The cell does not translate an mRNA in this double-stranded form. 
Therefore, antisense nucleic acids interfere with the expression of mRNA into protein. 
3 5 Oligomers of about fifteen nucleotides and molecules that hybridize to the AUG initiation 
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codon will be particularly efficient, since they are easy to synthesize and are likely to pose 
fewer problems than larger molecules when introducing them into organ cells. Antisense 
methods have been used to inhibit the expression of many genes in vitro. Preferably synthetic 
antisense nucleotides contain phosphoester analogs, such as phosphorothiolates, or thioesters, 

5 rather than natural phophoester bonds. Such phosphoester bond analogs are more resistant to 
degradation, increasing the stability, and therefore the efficacy, of the antisense nucleic acids. 

In the genetic antisense approach, expression of the wild-type allele is suppressed 
because of expression of antisense RNA. This technique has been used to inhibit TK 
synthesis in tissue culture and to produce phenotypes of the Kruppel mutation in Drosophila, 

10 and the Shiverer mutation in mice [Izant et al, Cell, 36:1007-1015 (1984); Green et al, Annu. 
Rev. Biochem., 55:569-597 (1986); KatsuM et al, Science, 241:593-595 (1988)]. An 
important advantage of this approach is that only a small portion of the gene need be 
expressed for effective inhibition of expression of the entire cognate mRNA. The antisense 
transgene will be placed under control of its own promoter or another promoter expressed in 

15 the correct cell type, and placed upstream of the SV40 polyA site. 

Ribozymes are RNA molecules possessing the ability to specifically cleave other 
single stranded RNA molecules in a manner somewhat analogous to DNA restriction 
endonucleases. Ribozymes were discovered from the observation that certain mRNAs have 
the ability to excise their own introns. By modifying the nucleotide sequence of these RNAs, 

20 researchers have been able to engineer molecules that recognize specific nucleotide sequences 
in an RNA molecule and cleave it. Recent reviews include Shippy et al., Mol. Biotechnol. 
12:1 17-129 (1999); Schmidt, Mol. Cells 9:459-463 (1999); Phylactou et al., Meth. Enzymol 
313:485-506 (2000); Oketani et al, J. Hepatol. 31:628-634 (1999); Macejak et al., 
Hepatology 31:769-776 (2000). The last two references disclose the use of ribozymes for 

25 inhibiting HCV. Because they are sequence-specific, only mRNAs with particular sequences 
are inactivated. 

Investigators have identified two types of ribozymes, Tetrahymena-type and 
"hammerhead M -type. Tetrahymena-type ribozymes recognize four-base sequences, while 
"hammerhead"-type recognize eleven- to eighteen-base sequences. The longer the 
30 recognition sequence, the more likely it is to occur exclusively in the target mRNA species. 
Therefore, hammerhead-type ribozymes are preferable to Tetrahymena-type ribozymes for 
inactivating a specific mRNA species, and eighteen base recognition sequences are preferable 
to shorter recognition sequences. 

Screening compound libraries for anti-HCV activity. Various natural product or 
35 synthetic libraries can be screened for anti-HCV activity in the in vitro or in vivo models 
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comprising HCV variants as provided by the invention. One approach to preparation of a 
combinatorial library uses primarily chemical methods, of which the Geysen method [Geysen 
et al., Molecular Immunology 23:709-715 (1986); Geysen et al.J. Immunologic Method 
102:259-274 (1987)] and the method of Fodor et al.[Science 251:767-773 (1991)] are 
5 examples. Furka et al.[14th International Congress of Biochemistry, Volume 5, Abstract 
FR:013 (1988); Furka, Int. J. Peptide Protein Res. 37:487-493 (1991)], Houghton [U.S. 
Patent No. 4,631,211, issued December 1986] andRuttere<a/.[U.S.PatentNo. 5,010,175, 
issued April 23, 1991] describe methods to produce a mixture of peptides that can be tested 
for anti-HCV activity. 

10 In another aspect, synthetic libraries [Needels et al, Proc. Natl. Acad. Sci. USA 

90:10700-4 (1993); Ohlmeyer et al, Proc. Natl Acad. Sci. USA 90:10922-10926 (1993); Lam 
et al, International Patent Publication No. WO 92/00252; Kocis et al, International Patent 
Publication No. WO 9428028], and the like can be used to screen for anti-HCV compounds 
according to the present invention. The references describe adaption of the library screening 

1 5 techniques in biological assays. 

Defined/engineered HCV variant virus particles for neutralization assays. The 
variants described herein can be used to produce defined stocks of HCV particles for 
infectivity and neutralization assays. Homogeneous stocks can be produced in the 
chimpanzee model, in cell culture systems, or using various heterologous expression systems 

20 (e.g. , baculovirus, yeast, mammalian cells; see supra). These stocks can be used in cell 
culture or in vivo assays to define molecules or gene therapy approaches capable of 
neutralizing HCV particle production or infectivity. Examples of such molecules include, but 
are not restricted to, polyclonal antibodies, monoclonal antibodies, artificial antibodies with 
engmeeredVoptimized specificity, single-chain antibodies (see the section on antibodies, 

25 infra), nucleic acids or derivatized nucleic acids selected for specific binding and 

neutralization, small orally bioavailable compounds, etc. Such neutralizing agents, targeted to 
conserved viral or cellular targets, can be either genotype or isolate-specific or broadly cross- 
reactive. They could be used either prophylactically or for passive immunotherapy to reduce 
viral load and perhaps increase the chances of more effective treatment in combination with 

30 other antiviral agents (e.g., IFN-a, ribavirin, etc.). Directed manipulation of HCV infectious 
clones can also be used to produce HCV stocks with defined changes in the glycoprotein 
hypervariable regions or in other epitopes to study mechanisms of antibody neutralization, 
CTL recognition, immune escape and immune enhancement. These studies will lead to 
identification of other virus-specific functions for anti-viral therapy. 

35 
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Dissection of HCV Replication 

Other HCV replication assays. This invention allows directed molecular genetic 
dissection of HCV replication. Such analyses are expected to (i) validate antiviral targets 
which are currently being pursued; and (ii) uncover unexpected new aspects of HCV 
replication amenable to therapeutic intervention. Targets for immediate validation through 
mutagenesis studies include the following: the 5' NTR, the HCV polyprotein and cleavage 
products, and the 3' NTR. As described above, analyses using the HCV variants and 
permissive cell cultures can be used to compare parental and mutant replication phenotypes 
after transfection of cell cultures with infectious RNA. Even though RT-PCR allows 
sensitive detection of viral RNA accumulation, mutations which decrease the efficiency of 
RNA replication may be difficult to analyze, unless conditional mutations are recovered. As a 
complement to first cycle analyses, fraws-complementation assays can be used to facilitate 
analysis of HCV mutant phenotypes and inhibitor screening. Chimeric variants comprising 
portions of heterologous systems (vaccinia, Sindbis, or non-viral) can be used to drive 
expression of the HCV RNA replicase proteins and/or packaging machinery [see Lemm and 
Rice, J. Virol. 67: 1905-1915 (1993a); Lemm and Rice, J. Virol. 67: 1916-1926 (1993b); 
Jjemmetal.EMBOJ. 13: 2925-2934 (1994); Li et al, J. Virol. 65:6714-6723(1991)]. If 
these elements are capable of functioning in trans, then co-expression of RNAs with 
appropriate cw-elements should result in RNA replication/packaging. Such systems therefore 
mimic steps in authentic RNA replication and virion assembly, but uncouple production of 
viral components from HCV replication. If HCV replication is somehow self-limiting, 
heterologous systems may drive significantly higher levels of RNA replication or particle 
production, facilitating analysis of mutant phenotypes and antiviral screening. A third 
approach is to devise cell-free systems for HCV template-dependent RNA replication. A 
coupled translation/replication and assembly system has been described for poliovirus in 
HeLa cells [Barton and Flanegan, J. Virol. 67: 822-831 (1993); Molla et al, Science 254: 
1647-1651 (1991)], and a template-dependent in vitro assay for initiation of negative-strand 
synthesis has been established for Sindbis virus. Similar in vitro systems using HCV variants 
are invaluable for studying many aspects of HCV replication as well as for inhibitor screening 
and evaluation. An example of each of these strategies follows. 

Trans-complementation of HCV RNA replication and/or packaging using viral or 
non-viral expression systems. Heterologous systems can be used to drive HCV replication. 
For example, the vaccinia/T7 cytoplasmic expression system has been extremely useful for 
trans-complementation of RNA virus replicase and packaging functions [see Ball, (1992) 
supra; Lemm and Rice, (1993a) supra; Lemm and Rice, (1993b) supra; Lemm et al, (1994) 
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supra; Pattnaik et al., (1992) supra; Pattnaik et at, Virology 206: 760-4 (1995); Porter et al, 
J. Virol. 69: 1548-1555(1995)]. In brief, a vaccinia recombinant (vTF7-3) is used to express 
T7 RNA polymerase (T7RNApol) in the cell type of interest. Target cDNAs, positioned 
downstream from the T7 promoter, are delivered either as vaccinia recombinants or by 
5 plasmid transfection. This system leads to high level RNA and protein expression. A 

variation of this approach, which obviates the need for vaccinia (which could interfere with 
HCV RNA replication or virion formation), is the pT7T7 system where the T7 promoter 
drives expression of T7RNApol [Chen et al., Nucleic Acids Res. 22: 21 14-2120. (1994)]. 
pT7T7 is mixed with T7RNApol (the protein) and co-transfected with the T7-driven target 
10 plasmid of interest. Added T7RNApol initiates transcription, leading to it own production 
and high level expression of the target gene. Using either approach, RNA transcripts of 
variants with precise 5' and 3' termini can be produced using the T7 transcription start site (5') 
and the cis-cleaving HCV ribozyme (Rz) (3') {Ball, (1992) supra; Pattnaik et al, (1992) 
supra]. 

15 These or similar expression systems can be used to establish assays for HCV RNA 

replication and particle formation using HCV variants, and for evaluation of compounds 
which might inhibit these processes. T7-driven protein expression constructs and full-length 
HCV variants incorporating the HCV ribozyme following the 3 ' NTR can also be used. A 
typical experimental plan to validate the assay as described for pT7T7, although essentially 

20 similar assays can be envisioned using vTF7-3 or cell lines expressing the T7 RNA 
polymerase. HCV-permissive cells are co-transfected with 

pT7T7+T7RNApol+p90/HCVFLlong pU Rz (or a negative control, such as AGDD). At 
different times post-transfection, accumulation of HCV proteins and RNAs, driven by the 
pT7T7 system, are followed by Western and Northern blotting, respectively. To assay for 

25 HCV-specific replicase function, actinomycin D is added to block DNA-dependent T7 

transcription [Lemm and Rice, (1993a), supra] and actinomycin D-resistant RNA synthesis is 
monitored by metabolic labeling. Radioactivity will be incorporated into full-length HCV 
RNAs for p90/HCVFL long pU/Rz, but not for p90/HCVFLAGDD/Rz. Using HCV variants 
of the invention, this assay system, or elaborated derivatives, can be used to screen for 

30 inhibitors and to study their effects on HCV RNA replication. 

Cell-free systems for assaying HCV replication and inhibitors thereof. Cell-free 
assays for studying HCV RNA replication and inhibitor screening can also be established 
using the variants described in this invention. Either virion or transcribed RNAs are used as 
substrate RNA. For HCV, full-length HCV variant RNAs transcribed in vitro can be used to 

35 program such in vitro systems and replication assayed essentially as described for poliovirus 
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[see Barton et al, (1995) supra]. In case hepatocyte-speciflc or other factors are required for 
HCV variant KNA replication, the system can be supplemented with hepatocyte or other cell 
extracts, or alternatively, a comparable system can be established using cell lines which have 
been shown to be permissive for replication of the HCV variants. 
5 One concern about this approach is that proper cell-free synthesis and processing of 

the HCV polyprotein must occur. Sufficient quantities of properly processed replicase 
components may be difficult to produce. To circumvent this problem, the T7 expression 
system can be used to express high levels of HCV replicase components in appropriate cells 
[see Lemm et al, (1997) supra], P15 membrane fractions from these cells (with added 

10 buffer, Mg 2+ , an ATP regenerating system, and NTPs) should be able to initiate and 

synthesize full-length negative-strand RNAs upon addition of HCV-specific template RNAs. 

Establishment of either or both of the above assays allows rapid and precise analysis 
of the effects of HCV mutations, host factors, involved in replication and inhibitors of the 
various steps in HCV RNA replication. These systems will also establish the requirements 

15 for helper systems for preparing replication-deficient HCV vectors. 

Vaccination and Protective Immunity 
There are still many unknown parameters that impact on development of effective HCV 
vaccines. It is clear in both man and the chimpanzee that some individuals can clear the 

20 infection. Also, 10-20% of those treated with IFN or about twice this percentage treated with 
IFN and ribavirin show a sustained response as evidenced by lack of circulating HCV RNA 
Other studies have shown a lack of protective immunity, as evidenced by successful 
reinfection with both homologous virus as well as with more distantly related HCV types 
[Farci et al, (1992) supra; Prince et al, (1992) supra]. Nonetheless, chimpanzees immunized 

25 with subunit vaccines consisting of E1E2 oligomers and vaccinia recombinants expressing 
these proteins are partially protected against low dose challenges [Choo et al, Proc. Natl 
Acad. Set USA 91:1294 (1994)]. The HCV variant technology described in this invention has 
utility not only for basic studies aimed at understanding the nature of protective immune 
responses against HCV, but also for novel vaccine production methods. 

30 Active immunity against HCV can be induced by immunization (vaccination) with an 

immunogenic amount of an attenuated or inactivated HCV variant virion, or HCV virus 
particle proteins, preferably with an immunologically effective adjuvant. An 
"immunologically effective adjuvant" is a material that enhances the immune response. 
Selection of an adjuvant depends on the subject to be vaccinated. Preferably, a 

35 pharmaceutically acceptable adjuvant is used. For example, a vaccine for a human should 
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avoid oil or hydrocarbon emulsion adjuvants, including complete and incomplete Freund's 
adjuvant. One example of an adjuvant suitable for use with humans is alum (alumina gel). A 
vaccine for an animal, however, may contain adjuvants not appropriate for use with humans. 
An alternative to a traditional vaccine comprising an antigen and an adjuvant involves 
5 the direct in vivo introduction of DNA or RNA encoding the antigen into tissues of a subject 
for expression of the antigen by the cells of the subject's tissue. Such vaccines are termed 
herein genetic vaccines, DNA vaccines, genetic vaccination, or nucleic acid-based vaccines. 
Methods of transfection as described above, such as DNA vectors or vector transporters, can 
be used for DNA vaccines. 

10 DNA vaccines are described, e.g., in International Patent Publication WO 95/20660 

and International Patent Publication WO 93/19183, the disclosures of which are hereby 
incorporated by reference in their entireties. The ability of directly injected DNA that 
encodes a viral protein or genome to elicit a protective immune response has been 
demonstrated in numerous experimental systems [Corny et al, Cancer Res., 54:1164-1168 

15 (1994); Cox et al, Virol, 67:5664-5667 ^ (1993); Davis et al, Hum. Mole. Genet., 2:1847-1851 
(1993); Sedegah et al, Proc. Natl. Acad. Set, 91:9866-9870 (1994); Montgomery et al, DNA 
Cell Bio., 12:777-783 (1993); Ulmer et al, Science, 259:1745-1749 (1993); Wang et al, 
Proc. Natl Acad. Set, 90:4156-4160 (1993); Xiang et al, Virology, 199:132-140 (1994)]. 
Studies to assess this strategy in neutralization of influenza virus have used both envelope and 

20 internal viral proteins to induce the production of antibodies, but in particular have focused on 
the viral hemagglutinin protein (HA) [Tynan et al, DNA Cell. Biol, 12:785-789 (1993A); 
Fynan et al, Proc. Natl Acad. Set, 90:11478-11482 (1993B); Robinson et al, Vaccine, 
11:957, (1993); Webster et al, Vaccine, 12:1495-1498 (1994)]. 

Vaccination through directly injecting DNA or RNA that encodes a protein to elicit a 

25 protective immune response produces both cell-mediated and humoral responses. This is 
analogous to results obtained with live viruses [Raz et al, Proc. Natl. Acad. Set, 91:9519- 
9523 (1994); Ulmer, 1993, supra; Wang, 1993, supra; Xiang, 1994, supra]. Studies with 
ferrets indicate that DNA vaccines against conserved internal viral proteins of influenza, 
together with surface glycoproteins, are more effective against antigenic variants of influenza 

30 virus than are either inactivated or subvirion vaccines [Donnelly et al, NatMedicine, 6:583- 
587 (1995)]. Indeed, reproducible immune responses to DNA encoding nucleoprotein have 
been reported in mice that last essentially for the lifetime of the animal [Yankauckas et al, 
DNA Cell Biol, 12: 771-776 (1993)]. 

A vaccine of the invention can be administered via any parenteral route, including but 

35 not limited to intramuscular, intraperitoneal, intravenous, intraarterial (e.g. , Ripatic artery) 
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and the like. Preferably, since the desired result of vaccination is to elucidate an immune 
response to HCV, administration directly, or by targeting or choice of a viral vector, 
indirectly, to lymphoid tissues, e.g., lymph nodes or spleen. Since immune cells are 
continually replicating, they are ideal target for retroviral vector-based nucleic acid vaccines, 
5 since retroviruses require replicating cells. 

Passive immunity can be conferred to an animal subject suspected of suffering an 
infection with HCV by administering antiserum, neutralizing polyclonal antibodies, or a 
neutralizing monoclonal antibody against HCV to the patient. Although passive immunity 
does not confer long-term protection, it can be a valuable tool for the treatment of an acute 

10 infection of a subject who has not been vaccinated. Preferably, the antibodies administered 
for passive immune therapy are autologous antibodies. For example, if the subject is a 
human, preferably the antibodies are of human origin or have been "humanized," in order to 
minimize the possibility of an immune response against the antibodies. In addition, genes 
encoding neutralizing antibodies can be introduced in vectors for expression in vivo, e.g., in 

15 hepatocytes. 

Antibodies for passive immune therapy. Preferably, HCV variant virions or virus 
particle proteins prepared as described above are used as an immunogen to generate 
antibodies that recognize HCV. The variants utilized should have wild-type coat Such 
antibodies include but are not limited to polyclonal, monoclonal, chimeric, single chain, Fab 

20 fragments, and an Fab expression library. Various procedures known in the art may be used 
for the production of polyclonal antibodies to HCV. For the production of antibody, various 
host animals can be immunized by injection with the HCV virions or polypeptide, e.g., as 
describe infra, including but not limited to rabbits, mice, rats, sheep, goats, etc. Various 
adjuvants may be used to increase the immunological response, depending on the host 

25 species, including but not limited to Freund's (complete and incomplete), mineral gels such as 
aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and 
potentially useful human adjuvants such as BCG (pacille Calmette-Guerin) and 
Corynebacterium parvum. 

30 For preparation of monoclonal antibodies directed toward HCV as described above, 

any technique that provides for the production of antibody molecules by continuous cell lines 
in culture may be used. These include but are not limited to the hybridoma technique 
originally developed by Kohler and Milstein [Nature 256:495-497 (1975)], as well as the 
trioma technique, the human B-cell hybridoma technique [Kozbor et al, Immunology Today 

35 4:72 1983); Cote et al, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030 (1983)], and me EBV- 
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hybridoma technique to produce human monoclonal antibodies [Cole et al, in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)]. In an additional 
embodiment of the invention, monoclonal antibodies can be produced in germ-free animals 
[International Patent Publication No. WO 89/12690, published 28 December 1989]. In fact, 
5 according to the invention, techniques developed for the production of "chimeric antibodies" 
[Morrison et al, J. Bacteriol. 159:870 (1984); Neuberger et at, Nature 312:604-608 (1984); 
Takeda et al, Nature 314:452-454 (1985)] by splicing the genes from a mouse antibody 
molecule specific for HCV together with genes from a human antibody molecule of 
appropriate biological activity can be used; such antibodies are within the scope of this 

1 0 invention. Such human or humanized chimeric antibodies are preferred for use in therapy of 
human diseases or disorders (described infra), since the human or humanized antibodies are 
much less likely than xenogenic antibodies to induce an immune response, in particular an 
allergic response, themselves. 

According to the invention, techniques described for the production of single chain 

15 antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. Patent 4,946,778] can 
be adapted to produce HCV-specific single chain antibodies. An additional embodiment of 
the invention utilizes the techniques described for the construction of Fab expression libraries 
[Huse et al, Science 246:1275-1281 (1989)] to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

20 Antibody fragments containing the idiotype of the antibody molecule can be 

generated by known techniques. For example, such fragments include but are not limited to: 
the F(ab')2 fragment which can be produced by pepsin digestion of the antibody molecule; the 
Fab' fragments which can be generated by reducing the disulfide bridges of the F(ab') 2 
fragment, and the Fab fragments which can be generated by treating the antibody molecule 

25 with papain and a reducing agent. 

HCV particles for subunit vaccination. The functional HCV variants of the present 
invention can be used to produce HCV-like particles for vaccination. Proper glycosylation, 
folding, and assembly of HCV particles may be important for producing appropriately 
antigenic and protective subunit vaccines. Several methods can be used for particle 

30 production. They include engineering of stable cell lines for inducible or constitutive 

expression of HCV-like particles (using bacterial, yeast or mammalian cells), or the use of 
higher level eukaryotic heterologous expression systems such as recombinant baculoviruses, 
vaccinia viruses [Moss, Proc. Natl Acad. Set U.S.A. 93: 1 1341-1 1348 (1996)], or 
alphaviruses [Frolov et al, (1996) supra]. HCV particles for immunization may be purified 

35 from either the media or disrupted cells, depending upon their localization. Such purified 
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HCV particles or mixtures of particles representing a spectrum of HCV genotypes, can be 
injected with our without various adjuvants to enhance immunogenicity. 

Infectious non-replicating HCV particles. In another manifestation, particles of HCV 
variants capable of receptor binding, entry, and translation of genome RNA can be produced. 
5 Heterologous expression approaches for production of such particles include, but are not 
restricted to, E. coli, yeast, or mammalian cell lines, appropriate host cells infected or 
harboring recombinant baculoviruses, recombinant vaccinia viruses, recombinant 
alphaviruses or RNA replicons, or recombinant adenoviruses, engineered to express 
appropriate HCV RNAs and proteins. In one example, two recombinant baculoviruses are 
10 engineered. One baculovirus expresses the HCV structural proteins (e.g. C-El-E2-p7) 
required for assembly of HCV particles. A second recombinant expresses the entire HCV 
genome RNA with precise 5' and 3' ends, except that a deletion, such as AGDD or 
GDD— >AAG (see example 1), is included to inactivate the HCV NS5B RDRP. Other 
mutations abolishing productive HCV replication could also be utilized instead or in 
15 combination. Cotransfection of appropriate host cells (Sf9, Sf21, etc.) with both 

recombinants will produce high levels of HCV structural proteins and genome RNA for 
packaging into HCV-like particles. Such particles can be produced at high levels, purified, 
and used for vaccination. Once introduced into the vaccinee, such particles will exhibit 
normal receptor binding and infection of HCV-susceptible cells. Entry will occur and the 
20 genome RNA will be translated to produce all of the normal HCV antigens, except that 
further replication of the genome will be completely blocked given the inactivated NS5B 
polymerase. Such particles are expected to elicit effective CTL responses against structural 
and nonstructural HCV protein antigens. This vaccination strategy alone or preferably in 
conjunction with the subunit strategy described above can be used to elicit high levels of both 
25 neutralizing antibodies and CTL responses to help clear the virus. A variety of different HCV 
genome RNA sequences can be utilized to ensure broadly cross-reactive and protective 
immune responses. In addition, modification of the HCV particles, either through genetic 
engineering, or by derivatization in vitro, could be used to target infection to cells most 
effective at eliciting protective and long lasting immune responses. 
30 Live-attenuated HCV derivatives. The ability to manipulate the HCV genome RNA 

sequence and thereby produce mutants with altered pathogenicity provides a means of 
constructing live-attenuated HCV variants appropriate for vaccination. Such vaccine 
candidates express protective antigens but would be impaired in their ability to cause disease, 
establish chronic infections, trigger autoimmune responses, and transform cells. 
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Additionally, viruses propagated in cell culture frequently acquire mutations in their 
RNA genomes that display attenuated phenotypes in vivo, while still retaining their 
immunogenicity. Attenuated virus strains would be impaired in their ability to cause disease 
and establish chronic infections. Production of HCV variants adapted for tissue culture may 
5 represent potential candidates for live-attenuated vaccines. An attractive possibility is the 
production of HCV derivatives containing the deletion in NS5A described in this application 
as clone I (see Example 1). Such a variant is less likely to revert to wild type in the host. 

HCV Variant-based Gene Expression Vectors 

10 Some of the same properties of HCV leading to chronic liver infection of humans may also be 
of great utility for designing vectors for gene expression in cell culture systems, genetic 
vaccination, and gene therapy. The HCV variants described herein can be engineered to 
produce chimeric RNAs designed for the expression of heterologous gene products (RNAs 
and proteins). Strategies have been described above and elsewhere [Bredenbeek and Rice, 

15 (1992) supra; Frolov et at, (1996) supra] and include, but are not limited to (i) in-frame 
fusion of the heterologous coding sequences with the HCV polyprotein; (ii) creation of 
additional cistrons in the HCV genome RNA; and (iii) inclusion of IRES elements to create 
multicistronic self-replicating HCV vector RNAs capable of expressing one or more 
heterologous genes (Figure 2). Functional HCV RNA backbones utilized for such vectors 

20 include, but are not limited to, (i) live-attenuated derivatives capable of replication and 
spread; (ii) RNA replication competent "dead end" derivatives lacking one or more viral 
components (e.g. the structural proteins) required for viral spread; (iii) mutant derivatives 
capable of high and low levels of HCV-specific RNA synthesis and accumulation; (iv) mutant 
derivatives adapted for replication in different human cell types; (v) engineered or selected 

25 mutant derivatives capable of prolonged noncytopathic replication in human cells. Vectors 
competent for RNA replication but not packaging or spread can be introduced either as naked 
RNA DNA, or packaged into virus-like particles. Such virus-like particles can be produced 
as described above and composed of either unmodified or altered HCV virion components 
designed for targeted transfection of the hepatocytes or other human cell types. Alternatively, 

30 HCV RNA vectors can be encapsidated and delivered using heterologous viral packaging 
machineries or encapsulated into liposomes modified for efficient gene delivery. These 
packaging strategies, and modifications thereof, can be utilized to efficiently target HCV 
vector RNAs to specific cell types. Using methods detailed above, similar HCV-derived 
vector systems, competent for replication and expression in other species, can also be derived. 
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Various methods, e.g., as set forth supra in connection with transfection of cells and 
DNA vaccines, can be used to introduce an HCV vector of the invention. Of primary interest 
is direct injection of functional HCV RNA or virions, e.g., in the liver. Targeted gene 
delivery is described in Ihtemational Patent Publication WO 95/28494, published October 
5 1995 . Alternatively, the vector can be introduced in vivo by lipofection. For the past decade, 
there has been increasing use of liposomes for encapsulation and transfection of nucleic acids . 
in vitro. Synthetic cationic lipids designed to limit the difficulties and dangers encountered 
with liposome mediated transfection can be used to prepare liposomes for in vivo transfection 
of a gene encoding a marker [Feigner, et. al., Proc. Natl. Acad. Sci. U.S.A. 84:7413-7417 

10 (1987); see Mackey, et al, Proc. Natl. Acad. Sci. U.S.A. 85:8027-803 1 (1988); Ulmer et al, 
Science 259:1745-1748 (1993)]. The use of cationic lipids may promote encapsulation of 
negatively charged nucleic acids, and also promote fusion with negatively charged cell 
membranes [Feigner and Ringold, Science 337:387-388 (1989)]. The use of lipofection to 
introduce exogenous genes into the specific organs in vivo has certain practical advantages. 

15 Molecular targeting of liposomes to specific cells represents one area of benefit. It is clear 
that directing transfection to particular cell types would be particularly advantageous in a 
tissue with cellular heterogeneity, such as pancreas, liver, kidney, and the brain. Lipids may 
be chemically coupled to other molecules for the purpose of targeting [see Mackey, et. al., 
supra]. Targeted peptides, e.g., hormones or neurotransmitters, and proteins such as 

20 antibodies, or non-peptide molecules could be coupled to liposomes chemically. Receptor- 
mediated DNA delivery approaches can also be used [Curiel et al., Hum. Gene Ther. 3: 147- 
154 (1992); Wu and Wu, J. Biol. Chem. 262:4429-4432 (1987)]. 

Examples of applications for gene therapy include, but are not limited to, (i) 
expression of enzymes or other molecules to correct inherited or acquired metabolic defects; 

25 (ii) expression of molecules to promote wound healing; (iii) expression of immunomodulatory 
molecules to promote immune-mediated regression or elimination of human cancers; (iv) 
targeted expression of toxic molecules or enzymes capable of activating cytotoxic drugs in 
tumors; (v) targeted expression of anti-viral or anti-microbial agents in pathogen-infected 
cells. Various therapeutic heterologous genes can be inserted in a gene therapy vector of the 

30 invention, such as but not limited to adenosine deaminase (ADA) to treat severe combined 
immunodeficiency (SCTD); marker genes or lymphokine genes into tumor infiltrating (TIL) T 
cells [Kasis et al, Proc. Natl. Acad. Sci. U.SA. 87:473 (1990); Culver et al, ibid. 88:3 155 
(1991)]; genes for clotting factors such as Factor VDI and Factor K for treating hemophilia 
[Dwarki et al.Proc. Natl. Acad. Sci. USA, 92:1023-1027 (19950); Thompson, Thromb. and 

35 Haemostatis, 66:1 19-122 (1991)]; and various other well known therapeutic genes such as, 
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but not limited to, P-globin, dystrophin, insulin, erythropoietin, growth hormone, 
glucocerebrosidase, p-glucuronidase, a-antitrypsin, phenylalanine hydroxylase, tyrosine 
hydroxylase, ornithine transcarbamylase, apolipoproteins, and the like. In general, see U.S. 
Patent No. 5,399,346 to Anderson et al 

5 Examples of applications for genetic vaccination (for protection from pathogens other 

than HCV) include, but are not limited to, expression of protective antigens from bacterial 
(e.g., uropathogenic E. coli, Streptoccoci, Staphlococci, Nisseria), parasitic (e.g, 
Plasmodium, Leishmania, Toxoplama), fungal (e.g., Candida, Histoplasma) , and viral (e.g., 
HTV, HSV, CMV, influenza) human pathogens, hnmunogenicity of protective antigens 

10 expressed using HCV-derived KNA expression vectors can be enhanced using adjuvants, 

including co-expression of immunomodulatory molecules, such as cytokines (e.g., DL-2, GM- 
CSF) to facilitate development of desired Thl versus Th2 responses. Such adjuvants can be 
either incorporated and co-expressed by HCV vectors themselves or administered in 
combination with these vectors using other methods. 

15 

Diagnostic Methods for Infectious HCV 
Diagnostic cell lines. The invention described herein can also be used to derive cell 
lines for sensitive diagnosis of infectious HCV inpatient samples. In concept, functional 
HCV components are used to test and create susceptible cell lines (as identified above) in 

20 which easily assayed reporter systems are selectively activated upon HCV infection. 
Examples include, but are not restricted to, (i) defective HCV RNAs lacking replicase 
components that are incorporated as transgenes and whose replication is upregulated or 
induced upon HCV infection; and (ii) sensitive heterologous amplifiable reporter systems 
activated by HCV infection. In the first manifestation, KNA signals required for HCV KNA 

25 amplification flank a convenient or a selectable marker (see above). Expression of such 

chimeric RNAs is driven by an appropriate nuclear promoter and elements required for proper 
nuclear processing and transport to the cytoplasm. Upon infection of the engineered cell line 
with HCV, cytoplasmic replication and amplification of the transgene is induced, triggering 
higher levels of reporter expression, as an indicator of productive HCV infection. 

30 In the second example, cell lines are designed for more tightly regulated but highly 

inducible reporter gene amplification and expression upon HCV infection. Although this 
amplfied system is described in the context of specific components, other equivalent 
components can be used. In one such system, an engineered alphavirus replicon transgene is 
created which lacks the alphavirus nsP4 polymerase, an enzyme absolutely required for 

35 alphavirus KNA amplification and normally produced by cleavage from the nonstructural 
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polyprotein. Additional features of this defective alphavirus replicon include a subgenomic 
RNA promoter, driving expression of a luciferase or GFP reporter gene. This promoter 
element is quiescent in the absence of productive cytoplasmic alphavirus replication. The cell 
line contains a second transgene for expression of gene fusion consisting of the HCV NS4A 

5 protein and the alphavirus nsP4 RDRP. This fused gene is expressed and targeted to the 

cytoplasmic membrane compartment, but this form of nsP4 would be inactive as a functional 
component of the alphavirus replication complex because a discrete nsP4 protein, with a 
precise N terminus is required for nsP4 activity [Lemm et al, EMBO J. 13:2925 (1994)]. An 
optional third transgene expresses a defective alphavirus RNA with cis signals for replication, 

1 0 transcription of subgenomic RNA encoding a ubiquitin-nsP4 fusion, and an alphavirus 
packaging signal. Upon infection of such a cell line by HCV, the HCV NS3 proteinase is 
produced, mediating trans cleavage of the NS4A-nsP4 fusion protein, activating the nsP4 
polymerase. This active polymerase, which functions in trans and is effective in minute 
amounts, then forms a functional alphavirus replication complex leading to amplification of 

1 5 the defective alphavirus replicon as well as the defective alphavirus RNA encoding ubiquitin- 
nsP4. Ubiquitin-nsP4, expressed from its subgenomic RNA is cleaved efficiently by cellular 
ubiquitin carboxyterminal hydrolase to product additional nsP4, in case this enzyme is 
limiting. Once activated, this system would produce extremely high levels of the reporter 
protein. The time scale of such an HCV infectivity assay is expected to be from hours (for 

20 sufficient reporter gene expression). 

Antibody diagnostics. In addition to the cell lines described here, HCV variant virus 
particles (virions) or components thereof, produced by the transfected or infected cell lines, or 
isolated from an inflected animal, may be used as antigens to detect anti-HCV antibodies in 
patient blood or blood products. Because the HCV variant virus particles are derived from an 

25 authentic HCV genome, particular components such as the coat proteins are likely to have 

immunogenic properties that more closely resemble or are identical to natural HCV virus than 
if those components were produced outside of a replicating HCV. Examples of such 
immunogenic properties include the display of wild-type HCV immunogenic epitopes, and 
modulation of transcription of genes encoding cellular immune-modulating cytokines. These 

30 reagents can be used to establish that a patient is infected with HCV by detecting 
seroconversion, i.e., generation of a population of HCV-specific antibodies. 

Alternatively, antibodies generated to the HCV variant products prepared as 
described herein can be used to detect the presence of HCV in biological samples from a 
subject. 
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Preferred embodiments of the invention are described in the following example. 
Other embodiments within the scope of the claims herein will be apparent to one skilled in the 
art from consideration of the specification or practice of the invention as disclosed herein. It 
is intended mat the specification, together with the examples, be considered exemplary only, 
5 with the scope and spirit of the invention being indicated by the claims which follow the 
examples. 

Example 1 

This example describes the production and evaluation of replicons comprising a neo 
1 0 selectable marker and a polyprotein coding region encoding subtype lb nonstructural 
proteins. 

Materials and Methods 

Cell lines. The Huh7 cell lines were generously provided by Robert Lanford 
(Southwest Foundation for Biomedical Research, San Antonio, U.S.A.) and Ralf 

15 Bartenschlager (Johannes Gutenberg University Mainz, Mainz, Germany) and maintained in 
Diilbecco's modified minimal essential media (DMEM; Gibco-BRL) supplemented with 10% 
fetal calf serum (FCS), and nonessential amino acids. 

Assembly of a selectable subtype lb replicon. An HCV subtype lb replicon was 
constructed which is similar to the replicon described in Lohmann et al., Science 285: 1 10-1 13 

20 (1 999). For that construction, a step-wise PCR-based assay utilizing KlenTaqLA DNA 

polymerase (Wayne Barnes, Washington University) was developed. cDNAs spanning 600- 
750 bases in length were assembled from 10-12 gel-purified oligonucleotides (60-80 
nucleotides in length) with unique complementary overlaps of 16 nucleotides. Four or six 
oligonucleotides representing the 5' portion of the region to be assembled were annealed and 

25 extended in a standard PCR. The remaining six oligonucleotides for the synthesis of the 3' 
half of the intended cDNA were mixed in a parallel PCR reaction. After 12 cycles of PCR, 
the extended double-stranded DNA products were combined and subjected to an additional 12 
cycles. The product of this reaction resolved as a smear on agarose gels which was excised 
and the DNA isolated from the agarose. One-fifth of the purified double-stranded DNA 

30 product was amplified by PCR using an outer primer pair containing unique restriction 

enzyme sites to facilitate directional cloning into the pGEM3Zf(+) plasmid vector (Promega). 
PCR products were purified, digested with appropriate restriction enzymes, and ligated into 
similarly cleaved pGEM3Zf(+). Multiple recombinant clones were sequenced and the correct 
clones identified. The overlapping cDNA fragments were assembled into the contiguous 

35 replicon sequence. In parallel, a replicon carrying the lethal mutation in the NS5B active site 
(Gly-Asp-Asp [GDD] to Ala-Ala-Gly [AGG]; pol-) was constructed. 
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RNA transcription and transfection. RNA transcripts were synthesized in a lOOul 
reaction mixture containing 40mM Tris-HCl (pH 7.9), lOmM NaCl, 12mM MgCl2, 2mM 
spermidine, 3mM each ATP, CTP, GTP and UTP, lOmM dithiothreitol, 100 U RNasin 
(Promega) and 100 U 17 RNA polymerase (Epicentre), and 2ug Sea /-linearized DNA. The 
5 DNA template was rigorously removed by serial digestions with 30 U DNase I (Boehringer). 
Ten ug of the DNase-digested RNA transcripts were electroporated into 6x 10 6 Huh7 cells 
using a model T820 squareporator (BTX), and plated on 150mm dishes. For selection of 
replicon-containing cells, medium was changed to complete medium containing geneticin 
(G418; lmg/ml; Gibco-BRL) at 24 hr post-transfection and thereafter the media was changed 

10 every 3-4 days. 

KNA analysis. Approximately 5x 10^ cells were preincubated for 1 h in DMEM 
lacking phosphate supplemented with 5% dialyzed FCS, l/20 th the normal concentration of 
phosphate and actinomycin D (4ug/ml; Sigma). [ 32 P]orthophosphate (200uCi/ml; ICN) was 
added and the incubation continued for an additional 12 h. Total cellular RNA was extracted 

15 with TRIZOL, precipitated, and resuspended in H2O (Gibco-BRL). Radiolabeled RNA was 
analyzed by denaturing agarose gel electrophoresis and visualized by autoradiography. 

Protein analysis. For immunoprecipitation, cell monolayers were incubated for 
either 4, 8 or 12 h in methionine- and cysteine-deficient MEM containing 1/40^ the normal 
concentration of methionine, 5% dialyzed FCS and Express 35 S 35 S protein labeling mix 

20 (lOOuCi/ml; NEN). Cells were lysed in lOOmM NaP04 pH 7.0 containing 1% sodium 

dodecyl sulfate (SDS) and protease inhibitors, and cellular DNA sheared by repeated passage 
through a 27.5 gauge needle. Viral proteins were immunoprecipitated essentially as described 
previously (Grakoui eta/, 1993), using patient serum, JHF, recognizing NS3, NS4B and 
NS5A or rabbit anti-NS5B and Pansorbin cells (Calbiochem). Immunoprecipitates were 

25 separated on 1 0% SDS-PAGE and visualized by autoradiography. 

Immunostaining. Cells cultured in 8 well chamber slides (Falcon) were fixed in 
acetone for lOmin at 4°C and allowed to air dry. Rehydrated monolayers were incubated at 
37°C with an antibody directed against NS3, followed by incubation with a species-specific 
fluorescein-conjugated secondary antibody (Pierce), and mounted in 90% glycerol saline 

30 containing 50mM Tris-HCl (pH 8.8). 

Reverse transcription (RT)-PCR. RNA was isolated from cells using TRIZOL 
(Gibco-BRL), precipitated and resuspended in H2O. Levels of HCV RNA were quantitated 
using competitive RT-PCR assays designed to amplify the 5' and 3' NTR sequences of HCV 
(Kolykhalov et al, 1996). For RT-PCR designed to amplify long cDNA fragments, about 

35 1000 molecules of HCV RNA was mixed with the HCV-specific primer, and the primer 

extended at 43.5°C for 1 h using Superscript II reverse transcriptase (Gibco-BRL). cDNAs 
were then amplified with KlenTaqLA DNA polymerase using 35 cycles of 95°C for 30 s, 55- 
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60°C for 30 s, and 68°C for 4 min. PCR products were recovered from preparative low 
melting-point agarose electrophoresis by phenol extraction, and ~40ng of purified PCR 
product directly sequenced. 

5 Results 

Establishment of G418-resistant colonies. Replicons similar to that described in 
Lohmann et al, supra, but derived from the H77 infectious clone, failed to confer resistance to 
G418 in five different hepatoma cell lines. Sequences of subtype lb were also used to 
assemble the replicon I377/NS3-3' (EMBL accession number AJ242652). Replicon RNAs 

10 were composed of the HCV internal ribosome entry site (IRES) driving neomycin 

phosphotransferase gene (Neo) expression and the IRES from encephalomyocarditis virus 
(EMCV), directing translation of HCV proteins NS3 to NS5B, followed by the 3' NTR ) 
(Figure 3). Two derivatives were constructed which either lacked 2 U nucleotides in the poly 
(U/UC) tract or carried an Avail restriction enzyme site in the variable region of the 3' NTR, 

15 designated HCVreplbBartMan/A2U*s and HCVreplbBartMan/Avall, respectively. Prior to 
transfection, translation and correct polyprotein processing was confirmed for each cDNA 
sequence using the vaccinia-T7 RNA polymerase expression system (data not shown). 

DNase-treated replicon RNAs were electroporated into Huh7 cells and after 2-3 
weeks in culture G418-resistant colonies were clearly visible. Both replicon derivatives were 

20 able to confer G418 resistance, and on average, only 1 in 10 6 cells became G418 resistant. In 
contrast, colonies were never observed for Huh7 cells electroporated in parallel with the 
replicon RNAs containing an inactive NS5B polymerase. 

Verification of autonomous replication. Twenty two independent colonies were 
isolated, 5 colonies corresponded to Huh7 cells transfected with RNA transcribed from 

25 HCVreplbBartMan/A2U's and Hie remaining 17 colonies were derived from 

HCVreplbBartMan/AvaH RNA. A number of assays were performed to verify that G418 
resistance was mediated by autonomously replicating HCV. Amplification of sequences 
within the 5' and 3' NTRs in a quantitative RT-PCR assay revealed copy numbers ranging 
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Identification of mutations in HCV replicons. The low frequency of G418- 
resistant colonies may be attributed to either a cell factor(s) requirement for replication or 
adaptive changes within the replicon sequence necessary for the establishment of HCV 
replication. To address the latter possibility, the entire replicon sequence was amplified from 
5 cDNA reverse transcribed from RNA isolated from five independent G41 8-resistant cell 
clones. Upon direct sequencing of the purified PCR population, multiple mutations were 
identified. The striking observation was that each cell clone carried a single nucleotide 
change within NS5 A resulting in a coding change (Figure 7). In one instance, a deletion of 47 
amino acids (I; Figure 7), encompassing the interferon sensitivity determining region (ISDR), 

10 was found. Sequence analysis of NS5A from another 8 G41 8-resistant cell clones revealed 
similar point mutations, although 2 clones, which have low levels of HCV replication and 
slow growth rates (e.g., clone E in Figure 4), were found to contain wild type NS5 A In 
addition to the identified NS5 A mutations, nucleotide substitutions were also noted in NS3 
and NS4B; Clone H (SEQ ID NO:9) contains substitutions at nt 3550 (NS3) and nt 4573 

15 (NS4B) (Lys (584) to Glu, and Ser(925) to Gly of SEQ ID NO:3, embodied in SEQ ID 
NO: 17), whereas nt 2060 (NS3) was mutated in Clone VI (Figure 7, corresponding to Gin 
(87) to Arg of SEQ ID NO:3, embodied in SEQ ID NO: 15). 

Reconstruction of mutant replicons. To determine if the nucleotide changes and 
the deletion identified in NS5A were adaptive, each mutation, except mutation n, was 

20 independently engineered back into the HCVreplbBartMan/Avall backbone. RNA 

transcribed from each reconstructed replicon was electroporated into naive Huh7 cells, and 
the number of G41 8-resistant colonies compared to that obtained for the 
HCVreplbBartMan/Avall replicon containing wild type NS5A. The 47 amino acid deletion, 
as well as the point mutations, were capable of increasing the frequency of G41 8-resistant 

25 colonies to at least 1% of the initial electroporated cell population (Figure 8), indicating these 
mutations targeting NS5A are adaptive allowing efficient HCV replication in Huh7 cells. In 
addition, G41 8-resistant colonies were observed after transfection of HeLa cells, a human 
epithelial cell line, with replicon RNA of clone I. Therefore, at least one of the mutations that 
was adaptive in Huh7 cells also allows the establishment of HCV replication in a non-hepatic 

30 cell line. 

Example 2 

This example describes the production of cell lines permissive for HCV replication; a 
replicon comprising the NS2 coding region; and full-length HCV cDNA clones comprising 
the Ser to lie substitution at position 1179 of SEQ DO NO: 3. 
35 Generation of cell lines. As shown in the previous example, G41 8-resistant cell clones 

harboring persistently replicating HCV RNAs were isolated. Two of these G41 8-resistant cell 
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clones were treated extensively with the antiviral, interferon-a, to obtain 2 cell lines void of 
HCV RNA. These are refered to as interferon-treated cell lines I and II. 

HCVreplbBartMan/Avall, HCV adaptive replicon I or HCV adaptive replicon VII 
were transfected into the interferon-treated cell lines, I and H. This resulted in a greater G418 
5 transduction efficiency than that observed for the parental Huh-7 cells (see Table 1). Early 
post-transfection HCV RNA amplification was greatest for the IFN-treated cell line. These 
results indicate that the cell lines, interferon-treated cell lines I and n, are more permissive for 
HCV replication than is the parental Huh-7 cell line. 

Such cell lines are not only valuable for genetic study of HCV, but also for examining 
10 the cellular environments more permissive for HCV replication. For example, microarray 
technology will allow us to look globally at differences in gene expression profiles between 
the different cell lines. 

Construction of replicons. A replicon was constructed wherein the 5WlR of HCV was 
fused to the IRES of EMCV upstream of NS3, thus creating a replicon lacking the neomycin 

15 phosphotransferase gene. This replicon, 5TSnU-EMCWHCVrepVII (SEQ ID NO:25), 
replicates to high levels in Huh7 cells, as shown in Figure 10. Another replicon, 
HCVrep/NS2-5B (SEQ ID NO:22) was made wherein the non-structural protein, NS2, is 
upstream of NS3. As shown in Figure 10, this replicon is also replication-competent in Huh7 
cells. This latter replicon can be used advantageously, for example, in testing compounds for 

20 inhibiting HCV replication. The addition of the NS2 coding region provides an additional 
target for such antiviral compounds, as well as providing an additional protein for genetic 
study. 

Full-length HCV RNAs. Two full-length HCV cDNA clones were assembled. The first, 
HCV FL (SEQ ID NO:24), contains the mutation that encodes a Ser to He substitution in 

25 NS5A, as shown at position 1 179 of SEQ ID NO:3 (see Figure 9). The second, HCV FL-Neo 
(SEQ ID NO:23), also encodes the Ser to He mutation, and in addition, comprises the 
neomycin phosphotransferase gene immediately 3' of the 5' NTR and the EMCV IRES 
immediately 5' to the HCV open reading frame (see Figure 9). Both of these full-length 
clones replicate in the interferon-treated cell line I, as shown in Figure 10. This result 

30 indicates that HCV replication is not dependent on the EMCV IRES driving the non-structural 
proteins of HCV, because the non-structural proteins of the HCV FL clone are driven by the 
HCV IRES in the full-length clone HCV FL. 

In addition, a G418 resistant cell line comprising the HCV FL-Neo clone has been 
generated from the interferon-treated cell line I described above. This cell line supports high 

35 levels of persistently replicating HCV FL-Neo RNA. 
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All references cited in this specification are hereby incorporated by reference. The 
discussion of the references herein is intended merely to summarize the assertions made by 
the authors and no admission is made that any reference constitutes prior art. Applicants 
reserve the right to challenge the accuracy and pertinence of the cited references. 
5 In view of the above, it will be seen that the several advantages of the invention are 

achieved and other advantages attained. 

As various changes could be made in the above methods and compositions without 
departing from the scope of the invention, it is intended that all matter contained in the above 
description and shown in the accompanying drawings and appendix shall be interpreted as 
10 illustrative and not in a limiting sense. 
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Appendix 
SEQIDNOs 



SEQIDNO-.l : 5' portion of an HCV 5' NTR 

5 

GGCGACACTC CACCATAGAT C 



SEQ ID NO:2 : 3' portion of a 3' NTR from a wild-type HCV subtype la 

10 

TGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGC 
ATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCTGATCATGT 



15 SEQIDNO:3 : Amino acid sequence of the polyprotein region of HCVreplbBartMan 

MAPrTAYSQQmGLLGCIITSLTGRDKISfQVEGEVQWSTATQSFLATCVNGVCWTW 
HGAGSKTLAGPKGP1TQMYTNVDQDLVGWQAPPGARSLTPCTCGSSDLYLVTRHAD 
V1PVPJRRGDSRGSLLSPPJPVSYLKGSSGGPLLCPSGHAVGIERAAVCTRGVAK 

20 PVESMETTMRSPWTDNSSPPAWQTFQVAHLHAPTGSGKSTKWAA 

VLNPSVAATLGFGAYMSKAHGIDPNIRTGVRT1TTGAPITYSTYGKFLADGGCSGGAY 
DmCDECHSTDSTmGIGTVLDQAETAGARLVVLATATPPGSVTVPHPNIEEVALSST 
GEIPFYGKAIPffimGGPJILIFCHSKKKCDELAAKLSGLGLNAVAYYRGLDVSVIPTS 
GDVTVVATDALMTGFTGDFDSVIDCMrc^ 

25 RGRTGRGRMGrYEU«VlPGERPSGMFT)SSV^ 

NIPGLPVCQDHLEFWESVFTGLTEtTOAHFLSQTKQAGDOTPYLVAYQATVCARAQA 
PPPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQ^ 

TSTWVLVGGVLAALAAYCLTTGSVVWGRIILSGKPAIPDP^VLYREFDEMEECASH 
LPYIEQGMQLAEQFKQKAIGLLQTATKQAEAAAPVVESKWRTLEAFWAKHMWOT 

30 GIQYLAGLSTLPGNPAIASL3VLAFTASITSPLTTQHTLLENILGGWVAAQLAPPSAASAF 
VGAGIAGAAVGSIGLGKVLVDILAGYGAGVAGALVAF1CVMSGEMPSTEDLVNLLPA 
ILSPGALVVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTO 
ARVTQrLSSLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTDFKT^ 
LLPRLPGWFFSCQRGYKGVWRGDGMQTTCPCGAQITGHATQ^GSMPJVGPRTCSOT 

35 WHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVTRVGDFHYWG 

CPCQVPAPEFFTEVDGVRLHRYAPAC^LLREEVIFLVGLNQYLVGSQLPCEPEPDV 
AVLTSMLTDPSHITAETAKRRLARGSPPSLASSSASQLSAPSLKATCTTRHDSPDADLI 
EANLLWQEMGGMTRVESFJSfKVmDSFEPLQAEEDEREVSWAEmRRSRE^PP^M 
PIWARPDYNPPLLESWKDPDYWPVVHGCPLPPAKAPPIPPPRPJaiTVVLSESW 

40 AELATKTFGSSESSAVDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSD 
GSWSTVSEEASEDWCCSMSYTWTGALIlPCAAEElEXPINALSNSLLPJfflDN^ 
TSRSASLRQKICVTFDPXQVLDDHYRDVLKEMKAKASTVKAKLLSV^ 
ARSKFGYGAKD VRNLS SKAVNHIRS VWEJDLLEDTETPIDTTIMAKNEVFCVQPEKGG 
RKPARLIVFPDLGWVCEKMALYDVVSTLPQAVMGSSYGFQYSPGQRVEFLVNAWK 

45 AKKCPMGFAYDTRCFDSTVTEM)IRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLT 
NSKGQNCGYRRCPvASGVLTTSCGNTLTCYLKAAAACRAAKLQDCTMLVC 
ICESAGTQEDEASLRAFTEAMmYSAPPGDPPKPEYDLELmCSSlWSVAHDASGKR 
VYYLTRDPTTPLARAAWETARBnPVNSWLGNIIMYAPTLWAR 
QLEKALDCQIYGACYSffiPLDLPQnQRLHGLSAFSLHSYSPGEINRVASCLRKLGVPPL 

50 RVWRHRARSVRARLLSQGGRAATCGKYLFNWAWTiCLKETPffAASQLDLSSWFVA 
GYSGGDrYHSLSRARPRWFMWCLLLLSVGVGIYLLPlSIR 
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SEQ ED NO:4 : Amino acid sequence of the NS5 A protein of HCVreplbBartMan 

5 SGSmRDVWDWICTVLTDFKTWLQSKLIPRLPGWFFSCQRGYKGVWGDGMQTT 
CPCGAQITGHVKNGSMRWGPRTCSNTWHGTFPINAYTTGPC1PSPAPNYS 
AAEEYVEVTRVGDFHYVTGMTTONVKCPCQWAPEFFTEVDGVP^ 
REEVTFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHITAETAERPvLARGSPPSLA 
SSSASQLSAPSLKATCTTRHDSPDADLffiANLLWRQEMGGNITRVESENKVVILDSFE 
1 0 PLQAEEDEREVSVPAEILRRSRIQT^ 

LPPAKAPPIPPPRPJCRTVVLSESTVSSALAELATKTFGSSESSAVDSGTATASPDQPSD 
DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 



1 5 SEQIDNO:5 : Nucleotide sequence of DNA clone of HCVreplbBartMan/A2U , s 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 

20 CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 

25 GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 
GTrCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 

30 TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 
CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 
ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 
GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCGATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 

35 ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 
TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 

40 AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 
TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 
GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 

45 AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 
TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 
CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
GAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTAT 

50 TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 
ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 
ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 
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CCAATGTGGACCAGGACCTCGTCGGCTGGCAAGCGCCCCCCGGGGCGCGTTCCTT 
GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 
GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 
CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 
5 ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 
GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 
GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 
CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 
GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 

10 GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 
CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 
GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 
CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 
CTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCGGGATCGGTCACCGT 

15 GCCACATCCAAACATCGAGGAGGTGGCTCIGTCCAGCACTGGAGAAATCCCCTTT 
TATGGCAAAGCCATCCCCATCGAGACCATCAAGGGGGGGAGGCACCTCATTTTCT 
GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 
CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 
GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG 

20 ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 
CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 
CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 
CCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATG 
ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 

25 GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 
GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 
AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 
CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACG 
GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 

30 CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 
CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 
CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 
CATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAG 
TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 

35 AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 
AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAG 
CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGG 
CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 
TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 

40 GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 
GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 
TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 
GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 
TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 

45 TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 
CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 
ACGTGTCACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 
ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 
ATGT1TGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 

50 CAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTAC 
AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 
CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACACGTGGCATGfGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 
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GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 
GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 
CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 
GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 
5 AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 
ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 
CACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCT 
TGGCCAGCTCATCAGCTAGCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCAC 
TACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGG 

10 CAGGAGATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATT 
TTGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTC 
CGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGG 
CACGCCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGT 
CCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCA 

15 CCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGAATCTACCGTGTCTTCTGCCT 
TGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACA 
GCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGAT 
CCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCC 
CGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGT 

20 CGTCTGCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCT 
GCGGAGGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACC 
ACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGG 
TCACCTTTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGA 
GATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGC 

25 CTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAG 
GACGTCCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGG 
ACTTGCTGGAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATG 
AGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGT 
ATTCCCAGATTTGGGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTC 

30 TCCACCCTCCCTCAGGCCGTGATGGGCTCTTCATACGGATTCCAATACTCTCCTGG 
ACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGG 
CTTCGCATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGT 
GTTGAGGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCA 
TAAGGTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGG 

35 GCAGAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTG 
CGGTAATACCCTCACATGTTACTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAG 
CTCCAGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAA 
GCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGA 
CTAGATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTT 

40 GATAACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGG 
GTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGA 
CAGCTAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCC 
CACCTTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTC 
AGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCAT 

45 TGAGCCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTT 
CACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAA 
ACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCCAGAAGTGTCCGCGCT 
AGGCTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACT 
GGGCAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGA 

50 TTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG 
TCTCGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGT 
AGGCATCTATCTACTCCCCAACCGATGAACGGGGAGCTAAACACTCCAGGCCAAT 
AGGCCATCCTGTTTTTTTCCCTTTTTT^^ 
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TTTTTCTCCTTTTTTTra 

GCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTG 
CTGATACTGGCCTCTCTGCAGATCAAGT 

5 

SEQ ED N0:6 : Nucleotide sequence of DNA clone of HCVreplbBarfMan/Avall, where the 
nucleotide change creating the Avail site is in lower case and highlighted in bold 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 

10 TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 

15 AGACCGTGCACCATGAGCACGAATCCTAAA(XTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 
GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 
GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 

20 ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 
TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 
CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 
ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 

25 GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 
ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 
TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 

30 CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 
AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 
TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 

35 GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 
AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 
TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 

40 CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
GAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTAT 
TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 
ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 
ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 

45 GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 
CCAATGTGGACCAGGACCTCGTCGGCTGGCAAGCGCCCCCCGGGGCGCGTTCCTT 
GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 
GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 
CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 
50 ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 
GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 
GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 
CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 
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GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 

GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 

CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 

GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 

CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 

CTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCGGGATCGGTCACCGT 

GCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTT 

TATGGCAAAGCCATCCCCATCGAGACCATCAAGGGGGGGAGGCACCTCATTTTCT 

GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 

GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG' 

ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 

CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 

CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 

CCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATG 

ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 

GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 

GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 

AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 

CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACG 

GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 

CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 

CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 

CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 

CATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAG 

TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 

AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 

AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGGGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTrAGCAGG 

CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 

TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 

GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 

GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 

TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 

GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 

TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 

TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 

CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 

ACGTGTCACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 

ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 

ATGTTTGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 

CAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTAC 

AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 

CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 

TGTAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 

GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 

GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 

CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 

GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 

AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 

ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 

CACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCT 

TGGCCAGCTCATCAGCTAGCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCAC 
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TACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGG 
CAGGAGATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATT 
TTGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTC 
CGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGG 
5 CACGCCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGT 
CCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCA 
CCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGAATCTACCGTGTCTTCTGCCT 
TGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACA 
GCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGAT 

10 CCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCC 
CGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGT 
CGTCTGCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCT 
GCGGAGGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACC 
ACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGG 

15 TCACCTTTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGA 
GATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGC 
CTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAG 
GACGTCCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGG 
ACTTGCTGGAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATG 

20 AGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGT 
ATTCCCAGATTTGGGGGTrCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTC 
TCCACCCTCCCTCAGGCCGTGATGGGCTCTTCATACGGATTCCAATACTCTCCTGG 
ACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGG 
CTTCGCATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGT 

25 GTTGAGGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCA 
TAAGGTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGG 
GCAGAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTG 
CGGTAATACCCTCACATGTTACTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAG 
CTCCAGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAA 

30 GCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGA 
CTAGATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTT 
GATAACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGG 
GTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGA 
CAGCTAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCC 

35 CACCTTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTC 
AGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCAT 
TGAGCCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTT 
CACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAA 
ACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCCAGAAGTGTCCGCGCT 

40 AGGCTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACT 
GGGCAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGA 
TTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG 
TCTCGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGT 
AGGCATCTATCTACTCCCCAACCGATGAACGGGGAcCTAAACACTCCAGGCCAAT 

45 AGGCCATCCTGTTTTTTTCCCTTTTTTTT^ 
TTTTTITCTCCTTTTITI^ 

TAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAG 
TGCTGATACTGGCCTCTCTGCAGATCAAGT 

50 

SEQIDNO:7 : Nucleotide sequence of DNA clone of HCV adaptive replicon I, where the 
amino acid generated by the deletion is identified in lower case and highlighted in bold 
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GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
5 GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 
GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 

10 GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 
TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGGGGCTGCATACGCTTGAT 

15 CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 
ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 
GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 
ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 

20 TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 
AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 

25 TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTGGCCAAAGGAATGCAAGGTCTGTTGAATGTC 
GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 
AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 

30 TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 
CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
GAACCACGGGGACGTGGTTTTCC1TTGAAAAACACGATAATACCATGGCGCCTAT 
TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 

35 ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 
ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 
CCAATGTGGACCAGGACCTCGTCGGCTGGCAAGCGCCCCCCGGGGCGCGTTCCTT 
GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 

40 GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 
CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 
ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 
GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 
GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 

45 CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 
GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 
GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 
CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 
GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 

50 CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 
CTGGAGCGCGACTCGTCGTGCTGGCCACCGCTACGCCTCCGGGATCGGTCACCGT 
GCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTT 
TATGGCAAAGCCATCCCCATCGAGACCATCAAGGGGGGGAGGCACCTCATTTTCT 
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GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 
CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 
GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG 
ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 
5 CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 
CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 
CCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATG 
ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 
GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 

10 GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 
AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 
CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACG 
GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 
CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 

15 CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 
CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 
CATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAG 
TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 
AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 

20 AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAG 
CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGG 
CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 
TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 
GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 

25 GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 
TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 
GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 
TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 
TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 

30 CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 
ACGTGTCACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 
ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 
ATGTTTGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 
CAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTAC 

35 AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 
CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 
GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 
GGAGTACGTGGAGGTrACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 

40 CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 
GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 
AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 
ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 
CACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCT 

45 TGGCCAGCTCATCAGCTAGCCAGCTGtacTCTTTCGAGCCGCTCCAAGCGGAGGAG 
GATGAGAGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTC 
CCTCGAGCGATGCCCATATGGGCACGCCCGGATTACAACCCTCCACTGTTAGAGT 
CCTGGAAGGACCCGGACTACGTCCCTCCAGTGGTACACGGGTGTCCATTGCCGCC 
TGCCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCA 

50 GAATCTACCGTGTCTTCTGCCTTGGCGGAGCTCGCCACAAAGACCTTCGGCAGCT 
CCGAATCGTCGGCCGTCGACAGCGGCACGGCAACGGCCTCTCCTGACCAGCCCTC 
CGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTT 
GAGGGGGAGCCGGGGGATCCCGATCTCAGCGACGGGTCTTGGTCTACCGTAAGC 



WO 01/89364 



PCT/US01/16822 



74 

GAGGAGGCTAGTGAGGACGTCGTCTGCTGCTCGATGTCCTACACATGGACAGGC 
GCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAATGCACTG 
AGCAACTCTTTGCTCCGTCACCACAACTTGGTCTATGCTACAACATCTCGCAGCG 
CAAGCCTGCGGCAGAAGAAGGTCACCTTTGACAGACTGCAGGTCCTGGACGACC 
5 ACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACAGTTAAGGCTA 
AACTTCTATCCGTGGAGGAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATC 
TAAATTTGGCTATGGGGCAAAGGACGTCCGGAACCTATCCAGCAAGGCCGTTAA 
CCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGAGACACCAATTGAC 
ACCACCATCATGGCAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGC 

10 CGCAAGCCAGCTCGCCTTATCGTATTCCCAGATTTGGGGGTTCGTGTGTGCGAGA 
AAATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAGGCCGTGATGGGCTCTTCA 
TACGGATTCCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGA 
AAGCGAAGAAATGCCCTATGGGCTTCGCATATGACACCCGCTGTTTTGACTCAAC 
GGTCACTGAGAATGACATCCGTGTTGAGGAGTCAATCTACCAATGTTGTGACTTG 

15 GCCCCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGG 
GGCCCCCTGACTAATTCTAAAGGGCAGAACTGCGGCTATCGCCGGTGCCGCGCGA 
GCGGTGTACTGACGACCAGCTGCGGTAATACCCTCACATGTTACTTGAAGGCCGC 
TGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGGAGAC 
GACCTTGTCGTTATCTGTGAAAGCGCGGGGACCCAAGAGGACGAGGCGAGCCTA 

20 CGGGCCTTCACGGAGGCTATGACTAGATACTCTGCCCCCCCTGGGGACCCGCCCA 
AACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTCAGTCGC 
GCACGATGCATCTGGCAAAAGGGTGTACTATCTCACCCGTGACCCCACCACCCCC 
CTTGCGCGGGCTGCGTGGGAGACAGCTAGACACACTCCAGTCAATTCCTGGCTAG 
GCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATCCTGATGACTCA 

25 TTTCTTCTCCATCCTTCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGA 
TCTACGGGGCCTGTTACTCCATTGAGCCACTTGACCTACCTCAGATCATTCAACG 
ACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTACTCTCCAGGTGAGATCAATA 
GGGTGGCTTCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACA 
TCGGGCCAGAAGTGTCCGCGCTAGGCTACTGTCCCAGGGGGGGAGGGCTGCCAC 

30 TTGTGGCAAGTACCTCTTCAACTGGGCAGTAAGGACCAAGCTCAAACTCACTCCA 
ATCCCGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGG 
GGGAGACATATATCACAGCCTGTCTCGTGCCCGACCCCGCTGGTTCATGTGGTGC 
CTACTCCTACTTTCTGTAGGGGTAGGCATCTATCTACTCCCCAACCGATGAACGG 
GGACCTAAACACTCCA 

TTCTTTCCTTTGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCC 
GTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGATCAAGT 



40 SEQ ED NO:8 : Nucleotide sequence ofDNA clone of HCV adaptive replicon VI, where 
nucleotide changes are in lower case and highlighted in- bold 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 

45 CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 

50 GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 
GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 
GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
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ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 

GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 

TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 

CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 

ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 

GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 

GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 

ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 

TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 

GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 

CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 

CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 

AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 

TGCCGTCTITrGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 

CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 

GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 

ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 

AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 

TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 

GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 

CGGTGK^ACATGCTTTACATGTGTITAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 

GAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTAT 

TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 

ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 

GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 

CCAATGTGGACCAGGACCTCGTCGGCTGGCgAGCGCCCCCCGGGGCGCGTTCCTT 

GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 

GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 

CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 

ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 

GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 

GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 

CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 

GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 

GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 

CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 

GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 

CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 

CTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCGGGATCGGTCACCGT 

GCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTT 

TATGGCAAAGCCATCCCCATCGAGA(XATCAAGGGGGGGAGGCACCTCATTTTCT 

GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 

GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG 

ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 

CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 

CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 

CCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATG 

ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 

GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 

GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 

AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 
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CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACG 
GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 
CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 
CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 
5 CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 
CATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAG 
TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 
AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 
AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAG 

10 CCTTCTGGGGGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGG 
CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 
TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 
GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 
GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 

15 TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 
GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 
TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 
TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 
CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 

20 ACGTGTCACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 
ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 
ATGTTTGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 
CAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTAC 
AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 

25 CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 
GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 
GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 
CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 

30 GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 
AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 
ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 
CACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCT 
TGGCCAGCTCATCAGCTAtCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCACT 

35 ACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGGC 
AGGAGATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTT 
TGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTC 
CGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGG 
CACGCCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGT 

40 CCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCA 

TGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACA 
GCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGAT 
CCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCC 

45 CGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGT 
CGTCTGCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCT 
GCGGAGGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACC 
ACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGG 
TCACCTTTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGA 

50 GATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGC 
CTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAG 
GACGTCCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGG 
ACTTGCTGGAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATG 
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AGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGT 
ATTCCCAGATTTGGGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTC 
TCCACCCTCCCTCAGGCCGTGATGGGCTCTTCATACGGATTCCAATACTCTCCTGG 
ACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGG 
5 CTTCGCATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGT 
GTTGAGGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCA 
TAAGGTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGG 
GCAGAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTG 
CGGTAATACCCTCACATGTTACTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAG 

10 CTCCAGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAA 
GCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGA 
CTAGATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTT 
GATAACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGG 
GTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGA 

15 CAGCTAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCC 
CACCTTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTC 
AGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCAT 
TGAGCCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTT 
CACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAA 

20 ACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCCAGAAGTGTCCGCGCT 
AGGCTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACT 
GGGCAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGA 
TTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG 
TCTCGTGCCCGACCCCGCTGG1TCATGTGGTGCCTACTCCTACTTTCTGTAGGGGT 

25 AGGCATCTATCTACTCCCCAACCGATGAACGGGGAGCTAAACACTCCAGGCCAAT 
AGGCCATCCTGTTTTTTTCCCTTITITTTm 
TTTTTCTCCTTTTTTTCT 

GCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTG 
CTGATACTGGCCTCTCTGCAGATCAAGT 

30 

SEQIDNO:9 : Nucleotide sequence of DNA clone of HCV adaptive replicon II, where 
nucleotide changes are in lower case and highlighted in bold 

35 GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 

40 AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 
GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 
GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 

45 GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 
TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 
CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 

50 ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 
GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 
ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 
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TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATrCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 
5 AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 
TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 
GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 

10 AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 
TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 
CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
GAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTAT 

15 TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 
ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 
ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 
CCAATGTGGACCAGGACCTCGTCGGCTGGCAAGCGCCCCCCGGGGCGCGTTCCTT 

20 GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 
GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 
CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 
ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 
GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 

25 GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 
CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 
GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 
GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 
CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 

30 GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 
CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 
CTGGAGCGCGACTCGTCG1XKTOGCCACCGCTACGCCTCCGGGATCGGTCACCGT 
GCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTT 
TATGGCAAAGCCATCCCCATCGAGACCATCAAGGGGGGGAGGCACCTCATTTTCT 

35 GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 
CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 
GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG 
ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 
CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 

40 CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 
CCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATG 
ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 
GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 
GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 

45 AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 
CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGgAGTGTCTCATACG 
GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 
CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 
CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 
50 CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 
CATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAG 
TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 
AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 



WO 01/89364 



PCT/US01/16822 



79 

AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTQGCGGACCCTCGAAG 
CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGG 
CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 
TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 
5 GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 
GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 
TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 
GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 
TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 
10 TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 
CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 
ACGTGTCACTCAGATCCTCTCTgGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 
ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 
ATGTTTGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 
15 CAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTAC 
AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 
CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 
GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 
20 GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 
CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 
GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 
AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 
ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 
25 CACATTACGGCGGAGACGGCTAAGCGTgGGCTGGCCAGGGGATCTCCCCCCTCCT 
TGGCCAGCTCATCAGCTAGCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCAC 
TACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGG 
CAGGAGATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATT 
TrGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTC 
30 CGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGG 
CACGCCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGT 
CCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCA 
CCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGAATCTACCGTGTCTTCTGCCT 
TGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACA 
35 GCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGAT 
CCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCC 
CGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGT 
CGTCTGCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCT 
GCGGAGGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACC 
40 ACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGG 
TCACCTTTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGA 
GATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGC 
CTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAG 
GACGTCCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGG 
45 ACTTGCTGGAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATG 
AGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGT 
ATTCCCAGATTTGGGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTC 
TCCACCCTCCCTCAGGCCGTGATGGGCTCTTCATACGGATTCCAATACTCTCCTGG 
ACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGG 
50 CTrCGCATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGT 
GTTGAGGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCA 
TAAGGTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGG 
GCAGAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTG 
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CGGTAATACCCTCACATGTTACTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAG 
CTCCAGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAA 
GCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGA 
CTAGATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTT 
5 GATAACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGG 
GTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGA 
CAGCTAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCC 
CACCTTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTC 
AGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCAT 

10 TGAGCCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTT 
CACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAA 
ACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCCAGAAGTGTCCGCGCT 
AGGCTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACT 
GGGCAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGA 

15 TTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG 
TCTCGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGT 
AGGCATCTATCTACTCCCCAACCGATGAACGGGGACCTAAACACTCCAGGCCAAT 
AGGCCATCCTGTTTTTTTCCCTTTTTTTT^ 
TTTTTTTCTCCTTTTTTTTTCCTCTT^ 

20 TAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAG 
TGCTGATACTGGCCTCTCTGCAGATCAAGT 



SEQIDNO:10 : Nucleotide sequence of DNA clone of HCV adaptive replicon V, where 
25 nucleotide change is in lower case and highlighted in bold 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 

30 CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 

35 GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 
GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 

40 TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 
CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 
ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 
GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 

45 ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 
TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 

50 AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 
TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 
GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
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ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 
AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 
TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 
5 CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
GAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTAT 
TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 
ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 
ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG . 

10 GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 
CCAATGTGGACCAGGACCTCGTCGGCTCjGCAAGCGCCCCCCGGGGCGCGTTCCTT 
GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 
GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 
CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 

15 ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 
GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 
GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 
CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 
GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 

20 GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 
CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 
GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 
CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 
CTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCGGGATCGGTCACCGT 

25 GCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTT 
TATGGCAAAGCCATCCCCATCGAGACCATCAAGGGGGKjGAGGCACCTCATTTTCT 
GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 
CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 
GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG 

30 ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 
CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 
CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 
CCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATG 
ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 

35 GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 
GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 
AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 
CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACG 
GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 

40 CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 
CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 
CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 
CATCTTGTCCGGAAAGCCGGCCAtCATTCCCGACAGGGAAGTCCTTTACCGGGAG 
TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 

45 AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 
AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAG 
CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGG 
CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 
TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 

50 GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 
GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 
TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 
GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 
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TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 
TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 
CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 
ACGTGTCACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 
5 ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 
ATGTTTGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 

AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 
CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 

10 TGTAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 
GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 
GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 
CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 
GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 

15 AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 
ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 
CACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCT 
TGtCCAGCTCATCAGCTAGCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCACT 
ACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGGC 

20 AGGAGATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTT 
TGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTC 
CGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGG 
CACGCCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGT 
CCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCA 

25 CCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGAATCTACCGTGTCTTCTGCCT 
TGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACA 
GCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGAT 
CCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCC 
CGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGT 

30 CGTCTGCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCT 
GCGGAGGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACC 
ACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGG 
TCACCTTTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGA 
GATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGC 

35 CTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAG 
GACGTCCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGG 
ACTTGCTGGAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATG 
AGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGT 
ATTCCCAGATTTGGGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTC 

40 TCCACCCTCCCTCAGGCCGTGATGGGCTCTTCATACGGATTCCAATACTCTCCTGG 
ACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGG 
CTTCGCATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGT 
GTTGAGGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCA 
TAAGGTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGG 

45 GCAGAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTG 
CGGTAATACCCTCACATGTTAGTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAG 
CTCCAGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAA 
GCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGA 
CTAGATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTT 

50 GATAACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGG 
GTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGA 
CAGCTAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCC 
CACCTTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTC 
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AGK3AACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCAT 
TGAGCCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTT 
CACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAA 
ACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGGCAGAAGTGTCCGCGCT 
5 AGGCTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACT 
GGGCAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGA 
TTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG 
TCTCGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGT 
AGGCATCTATCTACTCCCCAACCGATGAACGGGGACCTAAACACTCCAGGCCAAT 
10 AGGCCATCCTGTTTTTITCCCTTTTITITm 
TTTTTTTCTCCTTTTTTTT^ 

TAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAG 
TGCTGATACTGGCCTCTCTGCAGATCAAGT 

15 

SEQIDNO:!! : NS5A gene of DNA clone of HCV adaptive replicon IV, where nucleotide 
change is in lower case and highlighted in bold 

TCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGATT 

20 TCAAGACCTGGCTCCAGTCCAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTT 
CTCATGTCAACGTGGGTACAAGGGAGTCTGGCGGGGCGACGGCATCATGCAAAC 
CACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTCCATGAG 
GATCGTGGGGCCTAGGACCTGTAGTAACACGTGGCATGGAACATTCCCCATTAAC 
GCGTACACCACGGGCCCCTGCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGC 

25 TGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCC 
ACTACGTGACGGGCATGACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGC 
CCCCGAATTCTTCACAGAAGTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCG 
TGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGGGCTCAATCAATACC 
TGGTTGGGTCACAGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTC 

30 CATGCTCACCGACCCCTCCCACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCC 
AGGGGATCTCCCCCCTgCTTGGCCAGCTCATCAGCTAGCCAGCTGTCTGCGCCTTC 
CTTGAAGGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAG 
GCCAACCTCCTGTGGCGGCAGGAGATGGGCGGGAACATCACCCGCGTGGAGTCA 
GAAAATAAGGTAGTAATTTTGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGAT 

35 GAGAGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCT 
CGAGCGATGCCCATATGGGCACGCCCGGATTACAACCCTCCACTGTTAGAGTCCT 
GGAAGGACCCGGACTACGTCCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGC 
CAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGA 
ATCTACCGTGTCTTCTGCCTTGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCC 

40 GAATCGTCGGCCGTCGACAGCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCG 
ACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGA 
GGGGGAGCCGGGGGATCCCGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGA 
GGAGGCTAGTGAGGACGTCGTCTGCTGC 

45 

SEQIDNO: 12 : NS5A gene of HCV adaptive replicon HI, where nucleotide change is in 
lower case and highlighted in bold 

TCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGATT 
50 TCAAGACCTGGCTCCAGTCCAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTT 
CTCATGTCAACGTGGGTACAAGGGAGTCTGGCGGGGCGACGGCATCATGCAAAC 
CACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTCCATGAG 
GATCGTGGGGCCTAGGACCTGTAGTAACACGTGGCATGGAACATTCCCCATTAAC 
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GCGTACACCACGGGCCCCTGCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGC 
TGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCC 
ACTACGTGACGGGCATGACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGC 
CCCCGAATTCTTCACAGAAGTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCG 
5 TGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGGGCTCAATCAATACC 
TGGTTGGGTCACAGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTC 
CATGCTCACCGACCCCTCCCACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCC 
AGGGGATCTCCCCCCcCCTTGGCCAGCTCATCAGCTAGCCAGCTGTCTGCGCCTTC 
CTTGAAGGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAG 

10 GCCAACCTCCTGTGGCGGCAGGAGATGGGCGGGAACATCACCCGCGTGGAGTCA 
GAAAATAAGGTAGTAATTTTGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGAT 
GAGAGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCT 
CGAGCGATGCCCATATGGGCACGCCCGGATTACAACCCTCCACTGTTAGAGTCCT 
GGAAGGACCCGGACTACGTCCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGC 

15 CAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGA 
ATCTACCGTGTCTTCTGCCTTGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCC 
GAATCGTCGGCCGTCGACAGCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCG 
ACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGA 
GGGGGAGCCGGGGGATCCCGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGA 

20 GGAGGCTAGTGAGGACGTCGTCTGCTGC 



SEQIDNO:13 : Nucleotide sequence of DNA clone of HCV adaptive replicon VH, where 
nucleotide change is in lower case and highlighted in bold 

25 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 

30 GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 
GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 

35 GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 
TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 

40 CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 
ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAG 
GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 
ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 

45 TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTTTAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 
AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 

50 TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 
GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 
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AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 
TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 
. CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
5 GAACCACGGGGACGTGGTTTTCC1TKJAAAAACACGATAATACCATGGCGCCTAT 
TACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTC 
ACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCA 
ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACA 

10 CCAATGTGGACCAGGACCTCGTCGGCTGGCAAGCGCCCCCCGGGGCGCGTTCCTT 
GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATGCCGAT 
GTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGG 
CCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGC 
ACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGT 

15 GGACTTTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACG 
GACAACTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACG 
CCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAG 
GGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGC 
GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGAC 

20 CATCACCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGAC 
GGTGGTTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAA 
CTGACTCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGG 
CTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCGGGATCGGTCACCGT 
GCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTT 

25 TATGGCAAAGCCATCCCCATCGAGACCATCAAGGGGGGGAGGCACCTCATTTTCT 
GCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACT 
CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGA 
GACGTCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCG 
ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGA 

30 CCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCG 
CAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACT 
CCAGGAGAACGGCCCKX3GG(^TGTTCGATTCCT 

ACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCG 
GGCTTACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGG 

35 GAGAGCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTA 
AGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCG 
CCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACG 
GCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTT 
CAAAACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGT 

40 CGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAG 
CAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGAT 
CATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAG 
TTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGC 
AGCTCGCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCA 

45 AGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAG 
CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGG 
CTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCC 
TCTATCACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGG 
GATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGTAGGCGCC 

50 GGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATA 
TTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCAT 
GAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTC 
TCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACG 
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TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTT 
CGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGC 
ACGTGTCACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTC 
ACCAGTGGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAG 
5 ATGTTTGGGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTC 
CAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTAC 
AAGGGAGTCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCA 
CAGATCACCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCT 

10 GCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGA 
GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 
CACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAA 
GTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGG 
AGGAGGTCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCC 

15 ATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCC 
CACATTACGGCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCT 
TGGCCAGCTCATCAGCTAtCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCACT 
ACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGGC 
AGGAGATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTT 

20 TGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTC 
CGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGG 
CACGCCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGT 
CCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCA 
CCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAGAATCTACCGTGTCTTCTGCCT 

25 TGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACA 
GCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGAT 
CCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCC 
CGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGT 
CGTCTGCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCT 

30 GCGGAGGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACC 
ACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGG 
TCACCTTTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGA 
GATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGC 
CTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAG 

35 GACGTCCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGG 
ACTTGCTGKjAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATG 
AGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGT 
ATTCCCAGATTTGGGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTC 
TCCACCCTCCCTCAGGCCGTGATGGGCrCTTCATACGGATTCCAATACTCTCCTGG 

40 ACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGG 
CTTCGCATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGT 
GTTGAGGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCA 
TAAGGTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGG 
GCAGAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTG 

45 CGGTAATACCCTCACATGTTACTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAG 
CTCCAGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAA 
GCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGA 
CTAGATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTT 
GATAACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGG 

50 GTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGA 
CAGCTAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCC 
CACCTTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTC 
AGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCAT 
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TGAGCCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTT 
CACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAA 
ACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCCAGAAGTGTCCGCGCT 
AGGCTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACT 
5 GGGCAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGA 
TTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG 
TCTCGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGT 
AGGCATCTATCTACTCCCCAACCGATGAACGGGGAGCTAAACACTCCAGGCCAAT 
AGGCCATCCTGTTTTTTTCCCT^^ 
10 TTTTTCTCCTTTTTTTTTCCTCTTT^ 

GCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTG 
CTGATACTGGCCTCTCTGCAGATCAAGT 



15 SEQIDNO:14 : Amino acid sequence of the NS5A protein of HCV adaptive replicon I, 
where amino acid generated is highlighted in bold 

SGSWLRDVWDWICTVLTDFKTWLQSKXLPRLPGWFFSCQRGYKGVWRGDGMQTT 
CPCGAQITGHViaNrGSMRWG^ 
20 AAEEYVEVTRVGDFHYVTGMTTDlSrvTCCPCQWAPEFFTC 

PJEEVTFLVGLNQYLVGSQIJ'CEPEPDVAVLTSMLTDPSHITAETAKRRLARGSPPSLA 
SSSASQLYSFEPLQAEEDEREVSWAEILRRSPJH^PRAMPIWARPDYNPPLLESWKDP 
DYWPVVHGCPLPPAKAPPPPPRRKRTVVLSES 

TATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 

25 

SEQIDNO:15 : Amino acid sequence of the polyprotein coding region of HCV adaptive 
replicon VI, where amino acid changes are highlighted in bold 

30 MAPrTAYSQQTRGLLGCirrSLTGRDP^ 

HGAGSKTLAGPKGP^^QMYT^m)QDLVGWRAPPGARSLTPCTCGSSDLYLVIliH^ 

VIPVRElRGDSRGSLLSPPvPVSYLKGSSGGPLLCPSGHAVGIFRAAVCTRGVAKAVDFV 

PVESMETTMRSPVFTDNSSPPAWQ1PQVAHLHAPTGSGKSTKWAAYAAQGYKVL 

VLNPSVAATLGFGAYMSKAHGIDPMRTGVETITTGA^ 
35 DniODECHSTDSTmGIGTVLDQAETAG 

GEIPFYGKAIPffiTIKGGPJHLrFCHSKKXCDELAAKLSGLGLNAVAYYRGLDVSVIPTS 

GDVTVVATDALMTGFTGDFDSVIDCOTCrVTQTV^ 

RGRTGRGRMGIYRFVIPGERPSGMFDSSVLCEC^ 

lOTGLPVCQDHLEFWESVFTGLTEnDAHFLSQTKQAGDl^YLVAYQAWCARAQA 
40 PPPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEVTTTHPITKYIMACMSA^ 

TSTWVLVGKSVLAALAAYCLTTGSVVWGRIILSGKPAIIPDP^VLYREFDEMEECASH 
LPYffiQGMQIAEQFKQKAIGLI^TATKQAEAAAPVVESKmTLEAFWAKHMWNFIS 
GIQYLAGLSTLPGNPAIASLMAFTASITSPLTTQHTLLFNILGGWVAAQLAPPSAASAF 
VGAGIAGAAVGSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVNLLPA 
45 ILSPGALWGWCAAILPJIHVGPGEGAVQWMNPXIAFASRGNIWSPTHYVPESDAA 
ARVTQILSSLTITQLLKPXHQWINEDCSTPCSGSWLPJ3VWDWICTVLTO 
LLPPXPGWFFSCQRGYKGVWRGDGIMQTTCP^ 
WHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVTRVGDFHYW 
CPCQWAPEFFTEVDGVRLHRYAPAC^LLREEVTFLVGLNQYLVGSQLPCEPEPDV 
50 AVLTSMLTDPSHITAETAKPJILARGSPPSLASSSAIQLSAPSLKATCTTRHDSPDADLI 
EAM.LWQEMGGMTRVESENKVmDSFEPLQAEEDEMVSWAEn.RRSRKFPRAM 
PIWARPDYNPPLLESWKDPDYWPVVHGCPLPPAKAPPIPPPRRKRTVVLSESW 
AELATKTFGSSESSAVDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSD 
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GSWSTVSEEASEDWCCSMSYTWTGALI1PCAAEETKXPMALSNSLLRHHNLVYAT 
TSRSASLRQKKVTFDRLQVLDDHYRDVLKEMKAEASTVKAKLLSVEEACKLITPHS 
ARSKTGYGAKDVRM.SSKAVNH1PVSVWKDLLEDTE 

RKPARLrvTPDLGVRVCEKMALYDVVSTLPQAVMGSSYGFQYSPGQRVEFLVNAWK 
5 AKKCPMGFAYDmCFDSTVIENDIRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLT 
NSKGQNCGYRRCRASGVLTTSCGNTLTC^^ 

ICESAGTQEDEASLRAFTEAMTRYSAPPGDPPKPEYDLELITSCSSNVSVABDASGKR 
VYYLTRDPTTPLAEAAWETARHTPVNSWLGNIEMYAPTLW 

QLEKALDCQIYGACYSffiPLDLPQnQIUJIGLSAFSLHSYSPGEINRVASCLRKLGVPPL 
10 RVWRHRARSVRAPJ.LSQGGRAATCGKYLFNWAWTKLKLTPIPAASQLDLSSWFVA 
GYSGGDIYHSLSRARPRWFMWCLLLLSVGVGIYLLPNR 



SEQH)NO:16 : Amino acid sequence of the NS5A protein of HCV adaptive replicon VII, 
1 5 where amino acid change is highlighted in bold 

SGSWLRDVWDWICTVTTDFKTW^ 

CPCGAQITGHVKNGSMRWGPRTCSNTWHGTFPINAYTTGPC1PSPAPNYSRALWRV 
AAEEYVEVTRVGDFHYVTGMTTONVTCCPCQWAPEFFIEVD 
20 PxEEVlFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSfflTAETAKRRLARGSPPSLA 
SSSMQLSAPSLKATCTTRHDSPDADLmAM.LWQEMGGNITRVESENKVVILDSFEP 
LQAEEDEREVSWAEILRRSRKFPRAMPrWARPDYNPPLLESWKDPDYVPPVVHGCP 
LPPAKAPPPPPRRKRTVVLSESTVSSALAELATKTFGSSESSAVDSGTATASPDQPSD 
DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 

25 

SEQIDNO:17 : Amino acid sequence of the polyprotein of HCV adaptive replicon E, where 
amino acid changes are highlighted in bold 

30 MAPITAYSQQTRGLLGCirrSLTGRDRNQVEGEVQWSTATQSFLATCWGVCWTVY 
HGAGSKTLAGPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGSSDLYLVTPvHAD 
VIPVRRRGDSRGSLLSPRPVSYLKGSSGGPLLCPSGHAVGIFRAAVC1RGVAKAVDFV 
PVESMETTMRSPWTONSSPPAWQTFQVAHLHAPTGSGKSmWAAYAAQGYKVL 
W.NPSVAATLGFGAYMSKAHGroPNIRTGWllTTGAPrrYSTYGKFLADGGCSGGAY 

35 DIHCDECHSTDST1TLGIGTVXDQAETAGARLVVLATATPPGSVTW 

GEIPFYGKAIPffimGGPJILIFCHSKKKCDELAAKLSGLGLNAVAYYRGLDVSVn'TS 
GDVIWATDALMTGFTGDFDSV1DCNTCW 

RGRTGRGRMGrYRFVTPGERPSGMFDSSVLCECYDAGCAWYELTPAETSVRLRAYL 
NTPGLPVCQDHLEFWESVFTGLTEnDAHFLSQTKQAGDOTPYLVAYQATVCARAQA 
40 PPPSWDQMWECLmLKTTLHGPTPLLYRLGAVQNEVTTTHPimYIMA 
TSTWVLVGGVLAALAAYCLTTGSVVTVX^ 

LPYIEQGMQLAEQFKQKAIGLLQTAlKQAEAAAPVVESKWTLEAFWAKHMWNnS 
GIQYLAGLSTLPGNPAIASLMAFTASrTSPLTTQHTLLFNILGGWVAAQLAPPSAASAF 
VGAG1AGAAVGSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTBDLVNLLPA 

45 ILSPGALWGWCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAA 
ARWQILSGLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTDFKTWLQSK 
LLPPaPGVPFFSCQRGYKGVWRGDGMQTTCPCGAQITGHVKNGSMRIVGPRTCSNT 
WHGIFPmAYTTGPCTPSPAPNYSRALWVAAEEYV^VTRVGDFHYVTGMTTDlWK 
CPCQWAPEFFTEVDG\^RLHRYAPACKPLLREEVTFLVGLNQYLVGSQLPCEPEPDV 

50 AVLTSMLTDPSHITAETAKRGLARGSPPSLA5SSASQLSAPSLKATCTTRHDSPDADLI 
EANLLWRQEMGGNITRVESENKVmDSFEPLQAEEDEREVSWAEILRRSRETPRAM 
PrWARPDYl^PLLESWKDPDYWPVVHGCP^ 

AELATKTFGSSESSAVDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSD 
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GSWSTVSEEASEDWCCSMSYTWTGAIJIPCAAEETKLPINALSNSLLRHHNLVYAT 
TSRSASLRQKKVTFDRLQVLDDHYRDVLKEMKAICA.STVKAKLLSVEEACKL1PPHS 
ARSKFGYGAKDVRNLSSKAVNEnRSVWKDLLEDTETPIDT^ 

RKPARLIVFRDLGWVCEKMALYDVVSTLPQAVMGSSYGFQYSPGQRVEFLVNAWK 
5 AKKCPMGFAYDTRCFDSTVTEM)IRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLT 
NSKGQNCGYRRCRASGVLTTSCGOTLTCYLKAAAACRAAEXQDCTMLVCGD 
ICESAGTQEDEASLRAFTEAMIRYSAPPGDPPKPEYDLELITSCSS1WSVAHDASGKR 
VYYLTRDPTTPLARAAWETARHTPVNSWLGNIIMYAPTLW 

QLEKALDCQIYGACYSIEPLDLPQnQRLHGLSAFSLHSYSPGEINRVASCLRKLGVPPL 
10 RVWFJHRARSVRARLLSQGGRAATCGKYLF^ 

GYSGGDIYHSLSRARPRWFMWCLLLLSVGVGIYLLPNR 



SEQIDNO:18 : Amino acid sequence of the NS5A protein of HCV adaptive replicon H, 
15 where amino acid change is highlighted in bold 

SGSWLRDVWDWICTVLTDFKTWLQSKLLPRLPGWFESCQRGYKGVWRGDG 
CPCGAQJTGHVKNGSMRWGPRTCSNTO 
AAEEYVEVTRVGDFHYVTGMTTDN^ 
20 REEVTFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHTTAETAKRGLARGSPPSLA 
SSSASQLSAPSLKATCTTPJfflDSPDADLffiAmLWQEMGGNTmVESENKVVm 
PLQAEEDEREVSWAEILRRSRKFPPvAMPIWARPDYNPPLLESWKDPDYVPPVVHGCP 
LPPAKAPPffPPRRKRTVVXSESWSSALAELA'rKTFGSSESSAVDSGTATASPDQPSD 
DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 

25 

SEQIDNO:19 : Amino acid sequence of the NS5A protein of HCV adaptive replicon V, 
where amino acid, change is highlighted in bold 

30 SGSmRDVWDWICTVXTDFKTWLQSKLLPRLPGWFFSCQRGYKGVWRGDGnNlQTT 
CPCGAQITGHVKNGSMPJVGPRTCSNTWHGTFPnNlAYTTGPCTPSPAPNYSRALWRV 
AAEEYVEVTRVGDFHYVTGMTTONVKCPCQWAPEFFTEVDGVRLHRYA^ 
REEVTFLVGLNQYLVGSQLPCEPEPDVAvXTSMLTDPSHITAETAKRRLARGSPPSLS 
SSSASQLSAPSLKATCriRHDSPDADLIEANLLWQEMGGNITRVESEmVVILDSFE 

35 PLQAEEDEREVSWAEILRRSRKFPRAMPIWARPDYNPPIXESWKDPDYWPVV^ 
LPPAKAPPIPPPRRKRTVVLSESWSSALAELATKTFGSSESSAVDSGTATASPDQPSD 
DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 



40 SEQK)NO:20 : Amino acid sequence of the NS5A protein of HCV adaptive replicon IV, 
where amino acid change is highlighted in bold 

SGSWLRDVWDWICTVLTDFKTWL 

CPCGAQITGHVKNGSMPJVGPRTCSNTWHGTFPINAYTTGPCTPSPAPNYSPvALWV 
45 AAEEYVEVTRVGDFHYVTGMTTDNVKCPCQWAPEFFIEVDGVRLm 

REEVTFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHrTAETAKRRLARGSPPCLA 

SSSASQLSAPSLl^TCTTRHDSPDADLIEANLLWR 

PLQAEEDEREVSWAEILRRSRKFPRAMPIWARPDY]^ 

LPPAKAPPIPPPRRKIITV^SESWSSALAELAIICTFGSSESSAVDSGTATASPDQPSD 
50 DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 
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SEQIDNO:21 : Amino acid sequence of the NS5A protein of HCV adaptive replicon IE, 
where amino acid change is highlighted in bold 

SGSWLRDVWDWICTVLTDFKTmQSKLLPRLPGWFFSCQRGYKGVWRGDGIMQTT 
5 CPCGAQITGHVKNGSMiaVGPRTCSmiraGl¥PmYTTGPClPSPAPNYSRALWRV 
AAEEYVEVTRVGDFHYWGMT^ 

REEVTFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSfflTAETAKRRLARGSPPPLA 
SSSASQLSAPSLKATCTTRHDSPDADLffiAM.LWRQEMGGlOTllVESENKVVILDSFE 
PLQAEEDEREVSWAErLRRSRKTPRAMPrWARPDYNPPLLESWKDPDYVPPVVHGCP 
10 LPPAKAPPIPPPPJRKRTVVLSESWSSALAELATKTFGSSESSAVDSGTATASPDQPSD 
DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 

SEQIDNO:22: Nucleotide sequence of DNA clone of HCV adaptive replicon HCVrep/NS2- 
5B (see Figure 9) 

15 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 

20 GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCAGACCACAACGGTTTCCCTCTAGCGGGATCAATTCCGCCCCTC 
TCCCTCCCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGT 
GCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCITrTGGCAATGTGAGGGC 

25 CCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCG 
CCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAG 
CTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCC 
ACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGC 
AAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGT 

30 CAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGT 
ACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTT 
AGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTT 
TGAAAAACACGATAATACCATGGACCGGGAGATGGCAGCATCGTGCGGAGGCGC 
GGTTTTCGTAGGTCTGATACTCTTGACCTTGTCACCGCACTATAAGCTGTTCCTCG 

35 CTAGGCTCATATGGTGGTTACAATATTTTATCACCAGGGCCGAGGCACACTTGCA 
AGTGTGGATCCCCCCCCTCAACGTTCGGGGGGGCCGCGATGCCGTCATCCTCCTC 
ACGTGCGCGATCCACCCAGAGCTAATCTTTACCATCACCAAAATCTTGCTCGCCA 
TACTCGGTCCACTCATGGTGCTCCAGGCTGGTATAACCAAAGTGCCGTACTtCGT 
GCGCGCACACGGGCTCATTCGTGCATGCATGCTGGTGCGGAAGGTTGCTGGGGGT 

40 CATTATGTCCAAATGGCTCTCATGAAGTTGGCCGCACTGACAGGTACGTACGTTT 
ATGACCATCTCACCCCACTGCGGGACTGGGCCCACGCGGGCCTACGAGACCTTGC 

GGGGCAGACACCGCGGCGTGTGGGGACATCATCTTGGGCCTGCCCGTCTCCGCCC 
GCAGGGGGAGGGAGATACATCTGGGACCGGCAGACAGCCTTGAAGGGCAGGGG 

45 TGGCGACTCCTCGCGCCTATTACGGCCTACTCCCAACAGACGCGAGGCCTACTTG 
GCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGGGGGAGG 
TCCAAGTGGTCTCCACCGCAACACAATCTTTCCTGGCGACCTGCGTCAATGGCGT 
GTGTTGGACTGTCTATCATGGTGCCGGCTCAAAGACCCTTGCCGGCCCAAAGGGC 
CCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCGGCTGGCAAGCG 

50 CCCCCCGGGGCGCGTTCCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACT 
TGGTCACGAGGCATGCCGATGTCATTCCGGTGCGCCGGCGGGGCGACAGCAGGG 
GGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGCTCTTCGGGCGGTCC 
ACTGCTCTGCCCCTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACC 
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CGAGGGGTTGCGAAGGCGGTGGACTTTGTACCCGTCGAGTCTATGGAAACCACTA 
TGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTCCGGCCGTACCGCAGACATT 
CCAGGTGGCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCC 
GGCTGCGTATGCAGCCCAAGGGTATAAGGTGCTTGTCCTGAACCCGTCCGTCGCC 
5 GCCACCCTAGGTTTCGGGGCGTATATGTCTAAGGCACATGGTATCGACCCTAACA 
TCAGAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTA 
TGGCAAGTTTCTTGCCGACGGTGGTTGCTCTGGGGGCGCCTATGACATCATAATA 
TGTGATGAGTGCCACTCAACTGACTCGACCACTATCCTGGGCATCGGCACAGTCC 
TGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGC 

10 CTCCGGGATCGGTCACCGTGCCACATCCAAACATCGAGGAGGTGGCTCTGTCCAG 
CACTGGAGAAATCCCCTTTTATGGCAAAGCCATCCCCATCGAGACCATCAAGGGG 
GGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTCGCCGCGA 
AGCTGTCCGGCCTCGGACTCAATGCTGTAGCATATTACCGGGGCCTTGATGTATC 
CGTCATACCAACTAGCGGAGACGTCATTGTCGTAGCAACGGACGCTCTAATGACG 

15 GGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACATGTGTCACCCAGA 
CAGTCGACTTCAGCCTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCAC 
AAGACGCGGTGTCACGCTCGCAGCGGCGAGGCAGGACTGGTAGGGGCAGGATGG 
GCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTC 
GGTTCTGTGCGAGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCC 

20 GAGACCTCAGTTAGGTTGCGGGCTTACCTAAACACACCAGGGTTGCCCGTCTGCC 
AGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACAGGCCTCACCCACATAGACGC 
CCATTTCTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCA 
TACCAGGCTACGGTGTGCGCCAGGGCTCAGGCTCCACCTCCATCGTGGGACCAAA 
TGTGGAAGTGTCTCATACGGCTAAAGCCTACGCTGCACGGGCCAACGCCCCTGCT 

25 GTATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAA 
ATACATCATGGCATGCATGTCGGCTGACCTGGAGGTCGTCACGAGCACCTGGGTG 
CTGGTAGGCGGAGTCCTAGCAGCTCTGGCCGCGTATTGCCTGACAACAGGCAGCG 
TGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCGACAG 
GGAAGTCCTTTACCGGGAGTTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCT 

30 TACATCGAACAGGGAATGCAGCTCGCCGAACAATTCAAACAGAAGGCAATCGGG 
TTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAATCC 
AAGTGGCGGACCCTCGAAGCCITCtGGGCGAAGCATATGTGGAAm 
GGATACAATATTTAGCAGGCTTGTCCACTCTGCCTGGCAACCCCGCGATAGCATC 
ACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACCCAACATACCCTC 

35 CTGTTTAACATCCTGGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCT 
GCTTCTGCTTTCGTAGGCGCCGGCATCGCTGGAGCGGCTGTTGGCAGCATAGGCC 
TTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCGC 
GCTCGTGGCCTTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTT 
AACCTACTCCCTGCTATCCTCTCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCGC 

40 AGCGATACTGCGTCGGCACGTGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGAA 
CCGGCTGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTG 
CCTGAGAGCGACGCTGCAGCACGTGTCACTCAGATCCTCTCTAGTCTTACCATCA 
CTCAGCTGCTGAAGAGGCTTCACCAGTGGATCAACGAGGACTGCTCCACGCCATG. 
CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGAT 

45 TTCAAGACCTGGCTCCAGTCCAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCTT 
CTCATGTCAACGTGGGTACAAGGGAGTCTGGCGGGGCGACGGCATCATGCAAAC 
CACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTCCATGAG 
GATCGTGGGGCCTAGGACCTGTAGTAACACGTGGCATGGAACATTCCCCATTAAC 
GCGTACACCACGGGCCCCTGCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCGC 

50 TGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCC 
ACTACGTGACGGGCATGACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCC 
GGCCCCCGAATTCTTCACAGAAGTGGATGGGGTGCGGTTGCACAGGTACGCTCCA 
GCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGGGCTCAATCAAT 



WO 01/89364 



PCT/US01/16822 



92 

ACCTGGTTGGGTCACAGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCAC 
TTCCATGCTCACCGACCCCTCCCACATTACGGCGGAGACGGCTAAGCGTAGGCTG 
GCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCTATCCAGCTGTCTGCGC 
CTTCCTTGAAGGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCAT 
5 CGAGGCCAACCTCCTGTGGCGGCAGGAGATGGGCGGGAACATCACCCGCGTGGA 
GTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCGAGCCGCTCCAAGCGGAGGAG 
GATGAGAGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTC 
CCTCGAGCGATGCCCATATGGGCACGCCCGGATTACAACCCTCCACTGTTAGAGT 
CCTGGAAGGACCCGGACTACGTCCCTCCAGTGGTACACGGGTGTCCATTGCCGCC 

10 TGCCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCA 
GAATCTACCGTGTCTTCTGCCTTGGCGGAGCTCGCCACAAAGACCTTCGGCAGCT 
CCGAATCGTCGGCCGTCGACAGCGGCACGGCAACGGCCTCTCCTGACCAGCCCTC 
CGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTT 
GAGGGGGAGCCGGGGGATCCCGATCTCAGCGACGGGTCTTGGTCTACCGTAAGC 

15 GAGGAGGCTAGTGAGGACGTCGTCTGCTGCTCGATGTCCTACACATGGACAGGC 
GCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAATGCACTG 
AGCAACTCTTTGCTCCGTCACCACAACTTGGTCTATGCTACAACATCTCGCAGCG 
CAAGCCTGCGGCAGAAGAAGGTCACCTTTGACAGACTGCAGGTCCTGGACGACC 
ACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACAGTTAAGGCTA 

20 AACTTCTATCCGTGGAGGAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATC 
TAAATTTGGCTATGGGGCAAAGGACGTCCGGAACCTATCCAGCAAGGCCGTTAA 
CCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGAGACACCAATTGAC 
ACCACCATCATGGCAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGC 
CGCAAGCCAGCTCGCCTTATCGTATTCCCAGATTTGGGGGTTCGTGTGTGCGAGA 

25 AAATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAGGCCGTGATGGGCTCTTCA 
TACGGATTCCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGA 
AAGCGAAGAAATGCCCTATGGGCTTCGCATATGACACCCGCTGTTITGACTCAAC 
GGTCACTGAGAATGACATCCGTGTTGAGGAGTCAATCTACCAATGTTGTGACTTG 
GCCCCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGG 

30 GGCCCCCTGACTAATTCTAAAGGGCAGAACTGCGGCTATCGCCGGTGCCGCGCGA 
GCGGTGTACTGACGACCAGCTGCGGTAATACCCTCACATGTTACTTGAAGGCCGC 
TGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGGAGAC 
GACCTTGTCGTTATCTGTGAAAGCGCGGGGACCCAAGAGGACGAGGCGAGCCTA 
CGGGCCTTCACGGAGGCTATGACTAGATACTCTGCCCCCCCTGGGGACCCGCCCA 

35 AACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTCAGTCGC 
GCACGATGCATCTGGCAAAAGGGTGTACTATCTCACCCGTGACCCCACCACCCCC 
CTTGCGCGGGCTGCGTGGGAGACAGCTAGACACACTCCAGTCAATTCCTGGCTAG 
GCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATCCTGATGACTCA 
TTTCTrCTCCATCCTrCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGA 

40 TCTACGGGGCCTGTTACTCCATTGAGCCACTTGACCTACCTCAGATCATTCAACG 
ACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTACTCTCCAGGTGAGATCAATA 
GGGTGGCTTCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACA 
TCGGGCCAGAAGTGTCCGCGCTAGGCTACTGTCCCAGGGGGGGAGGGCTGCCAC 
TTGTGGCAAGTACCTCTTCAACTGGGCAGTAAGGACCAAGCTCAAACTCACTCCA 

45 ATCCCGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGG 
GGGAGACATATATCACAGCCTGTCTCGTGCCCGACCCCGCTGGTTCATGTGGTGC 
CTACTCCTACTTTCTGTAGGGGTAGGCATCTATCTACTCCCCAACCGATGAACGG 
GGACCTAAACACTCCAGGCCAATAGGCCATCCTG^ 
TTTTTTTTTTTTTTTT^^ 

50 TTCTTTCCTTTGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCC 
GTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGATCAAGT 
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SEQIDNO:23: Nucleotide sequence of full-length HCV cDNA clone containing the 
mutation that results in Ser to lie at position 1 1 79 of SEQ ID NO:3, and where the 5' NTR is 
fused to the neomycin phosphotransferase gene and the EMCV IRES is inserted upstream of 
the HCV open reading frame (see Figure 9) 

5 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCGCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 

10 GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAGGGCGC 
GCCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGA 
GGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT 

15 GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCC 
GGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACG 
ACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACT 
GGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC 
TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGAT 

20 CCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGT 
ACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGGATCAG 
GGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGC 
GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 
ATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTA 

25 TCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 
CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGTITAAACAGACCACAACGGTTTCC 
CTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCG 
AAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATAT 

30 TGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAG 
CATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC 
GTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCG 
ACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAA 
AGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTG 

35 TGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAA 
GGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCT 
CGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCC 
GAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATAATGAGCACGAAT 
CCTAAACCTCAAAGAAAAACCAAACGTAACACCAACCGCCGCCCACAGGACGTC 

40 AAGTTCCCGGGCGGTGGTCAGATCGTCGGTGGAGTTTACCTGTTGCCGCGCAGGG 
GCCCCAGGTTGGGTGTGCGCGCGACTAGGAAGACTTCCGAGCGGTCGCAACCTC 
GTGGAAGGCGACAACCTATCCCCAAGGCTCGCCAGCCCGAGGGTAGGGCCTGGG 
CTCAGCCCGGGTACCCCTGGCCCCTCTATGGCAATGAGGGCTTGGGGTGGGCAGG 
ATGGCTCCTGTCACCCCGTGGCTCTCGGCCTAGTTGGGGCCCCACGGACCCCCGG 

45 CGTAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTCACGTGCGGCTTCGCCG 
ATCTCATGGGGTACATTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGC 
CCTGGCGCATGGCGTCCGGGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAA 
TCTGCCCGGTTGCTCCTlTrCTATCTTCCTTTTGGCTTTGCTGTCCTGTTTGACCA 
CCCAGCTTCCGCTTATGAAGTGCGCAACGTATCCGGAGTGTACCATGTCACGAAC 

50 GACTGCTCCAACGCAAGCATTGTGTATGAGGCAGCGGACATGATCATGCATACCC 
CCGGGTGCGTGCCCTGCGTTCGGGAGAACAACTCCTCCCGCTGCTGGGTAGCGCT 
CACTCCCACGCTCGCGGCCAGGAACGCTAGCGTCCCCACTACGACGATACGACGC 
CATGTCGATTTGCTCGTTGGGGCGGCTGCTCTCTGCTCCGCTATGTACGTGGGAG 
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ATCTCTGCGGATCTGTTTTCCTCGTCGCCCAGCTGTTCACCTTCTCGCCTCGCCGG 
CACGAGACAGTACAGGACTGCAATTGCTCAATATATCCCGGCCACGTGACAGGTC 
ACCGTATGGCTTGGGATATGATGATGAACTGGTCACCTACAGCAGCCCTAGTGGT 
ATCGCAGTTACTCCGGATCCCACAAGCTGTCGTGGATATGGTGGCGGGGGCCCAT 
5 TGGGGAGTCCTAGCGGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAGG 
TTCTGATTGTGATGCTACTCTTTGCCGGCGTTGACGGGGGAACCTATGTGACAGG 
GGGGACGATGGCCAAAAACACCCTCGGGATTACGTCCCTCTTTTCACCCGGGTCA 
TCCCAGAAAATCCAGCTTGTAAACACCAACGGCAGCTGGCACATCAACAGGACT 
GCCCTGAACTGCAATGACTCCCTCAACACTGGGTTCCTTGCTGCGCTGTTCTACGT 

10 GCACAAGTTCAACTCATCTGGATGCCCAGAGCGCATGGCCAGCTGCAGCCCCATC 
GACGCGTTCGCTCAGGGGTGGGGGCCCATCACTTACAATGAGTCACACAGCTCGG 
ACCAGAGGCCTTATTGTTGGCACTACGCACCCCGGCCGTGCGGTATCGTACCCGC 
GGCGCAGGTGTGTGGTCCAGTGTACTGCTTCACCCCAAGCCCTGTCGTGGTGGGG 
ACGACCGACCGGTTCGGCGTCCCTACGTACAGTTGGGGGGAGAATGAGACGGAC 

15 GTGCTGCTTCITAACAACACGCGGCCGCCGCAAGGCAACTGGTTTGGCTGTACAT 
GGATGAATAGCACTGGGTTCACCAAGACGTGCGGGGGCCCCCCGTGTAACATCG 
GGGGGATCGGCAATAAAACCTTGACCTGCCCCACGGACTGCTTCCGGAAGCACC 
CCGAGGCCACTTACACCAAGTGTGGTTCGGGGCCTTGGTTGACACCCAGATGCTT 
GGTCCACTACCCATACAGGCTTTGGCACTACCCCTGCACTGTCAACTTTACCATCT 

20 TCAAGGTTAGGATGTACGTGGGGGGAGTGGAGCACAGGCTCGAAGCCGCATGCA 
ATrGGACTCGAGGAGAGCGTTGTAACCTGGAGGACAGGGACAGATCAGAGCTTA 
GCCCGCTGCTGCTGTCTACAACGGAGTGGCAGGTATTGCCCTGTTCCTTCACCAC 
CCTACCGGCTCTGTCCACTGGTTTGATCCATCTCCATCAGAACGTCGTGGACGTAC 
AATACCTGTACGGTATAGGGTCGGCGGTTGTCTCCTTTGCAATCAAATGGGAGTA 

25 TGTCCTGTTGCTCTTCCTTCTTCTGGCGGACGCGCGCGTCTGTGCCTGCTTGTGGA 
TGATGCTGCTGATAGCTCAAGCTGAGGCCGCCCTAGAGAACCTGGTGGTCCTCAA 
CGCGGCATCCGTGGCCGGGGCGCATGGCATTCTCTCCTTCCTCGTGTTCTTCTGTG 
CTGCCTGGTACATCAAGGGCAGGCTGGTCCCTGGGGCGGCATATGCCCTCTACGG 
CGTATGGCCGCTACTCCTGCTCCTGCTGGCGTTACCACCACGAGCATACGCCATG 

30 GACCGGGAGATGGCAGCATCGTGCGGAGGCGCGGTTTTCGTAGGTCTGATACTCT 
TGACCTTGTCACCGCACTATAAGCTGTTCCTCGCTAGGCTCATATGGTGGTTACAA 
TATTTTATCACCAGGGCCGAGGCACACTTGCAAGTGTGGATCCCCCCCCTCAACG 
TTCGGGGGGGCGGCGATGCCGTCATCCTCCTCACGTGCGCGATCCACCCAGAGCT 
AATCTTTACCATCACCAAAATCTTGCTCGCCATACTCGGTCCACTCATGGTGCTCC 

35 AGGCTGGTATAACCAAAGTGCCGTACTTCGTGCGCGCACACGGGCTCATTCGTGC 
ATGCATGCTGGTGCGGAAGGTTGCTGGGGGTCATTATGTCCAAATGGCTCTCATG 
AAGTTGGCCGCACTGACAGGTACGTACGTTTATGACCATCTCACCCCACTGCGGG 
ACTGGGCCCACGCGGGCCTACGAGACCTTGCGGTGGCAGTTGAGCCCGTCGTCTT 
CTCTGATATGGAGACCAAGGTTATCACCTGGGGGGCAGACACCGCGGCGTGTGG 

40 GGACATCATCTTGGGCCTGCCCGTCTCCGCCCGCAGGGGGAGGGAGATACATCTG 
GGACCGGCAGACAGCCTTGAAGGGCAGGGGTGGCGACTCCTCGCGCCTATTACG 
GCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTCACAG 
GCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCAACAC 
AATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATGGTGCC 

45 GGCTCAAAGACCCTTGCCGGCCCAAAGGGCCCAATCACCCAAATGTACACCAAT 
GTGGACCAGGACCTCGTCGGCTGGCAAGCGCCCCCCGGGGCGCGTTCCTTGACAC 
CATGCACCTGCGGCAGCTCGGACCnTACTTGGTCACGAGGCATGCCGATGTCAT 
TCCGGTGCGCCGGCGGGGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGT 
CTCCTACTTGAAGGGCTCTTCGGGCGGTCCACTGCTCTGCCCCTCGGGGCACGCT 

50 GTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGCGAAGGCGGTGGACT 
TTGTACCCGTCGAGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAA 
CTCGTCCCCTCCGGCCGTACCGCAGACATTCCAGGTGGCCCATCTACACGCCCCT 
ACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCGTATGCAGCCCAAGGGTAT 
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AAGGTGCTTGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATA 
TGTCTAAGGCACATGGTATCGACCCTAACATCAGAACCGGGGTAAGGACCATCA 
CCACGGGTGCCCCCATCACGTACTCCACCTATGGCAAGTTTCTTGCCGACGGTGG 
TTGCTCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGAC 
5 TCGACCACTATCCTGGGCATCGGCACAGTCCTGGACCAAGCGGAGACGGCTGGA 
GCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCGGGATCGGTCACCGTGCCAC 
ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATGG 
CAAAGCCATCCCCATCGAGACCATCAAGGGGGGGAGGCACCTCATTTTCTGCCAT 
TCCAAGAAGAAATGTGATGAGCTCGCCGCGAAGCTGTCCGGCCTCGGACTCAAT 

10 GCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGGAGACG 
TCATTGTCGTAGCAACGGACGCTCTAATGACGGGCTTTACCGGCGATTTCGACTC 
AGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACTTCAGCCTGGACCCG 
ACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCACGCTCGCAGC 
GGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAG 

15 GAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTTCTGTGCGAGTGCTATGACGC 
GGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAGTTAGGTTGCGGGCT 
TACCTAAACACACCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGA 
GCGTCTTTACAGGCCTCACCCACATAGACGCCCATTTCTTGTCCCAGACTAAGCA 
GGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAGGCTACGGTGTGCGCCAG 

20 GGCTCAGGCTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTA 
AAGCCTACGCTGCACGGGCCAACGCCCCTGCTGTATAGGCTGGGAGCCGTTCAAA 
ACGAGGTTACTACCACACACCCCATAACCAAATACATCATGGCATGCATGTCGGC 
TGACCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCT 
CTGGCCGCGTATTGCCTGACAACAGGCAGCGTGGTCATTGTGGGCAGGATCATCT 

25 TGTCCGGAAAGCCGGCCATCATTCCCGACAGGGAAGTCCTTTACCGGGAGTTCGA 
TGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCAGCTC 
GCCGAACAATTCAAACAGAAGGCAATCGGGTTGCTGCAAACAGCCACCAAGCAA 
GCGGAGGCTGCTGCTCCCGTGGTGGAATCCAAGTGGCGGACCCTCGAAGCCTTCT 
GGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAGGCTTGTC 

30 CACTCTGCCTGGCAACCCCGCGATAGCATCACTGATGGCATTCACAGCCTCTATC 
ACCAGCCCGCTCACCACCCAACATACCCTCCTGTTTAACATCCTGGGGGGATGGG 
TGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTrCTGCTTTCGTAGGCGCCGGCATC 
GCTGGAGCGGCTGTTGGCAGCATAGGCCTTGGGAAGGTGCITGTGGATATTTTGG 
CAGGTTATGGAGCAGGGGTGGCAGGCGCGCTCGTGGCCTTTAAGGTCATGAGCG 

35 GCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTCCCTGCTATCCTCTCCCCT 
GGCGCCCTAGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCC 
CAGGGGAGGGGGCTGTGCAGTGGATGAACCGGCTGATAGCGTTCGCTTCGCGGG 
GTAACCACGTCTCCCCCACGCACTATGTGCCTGAGAGCGACGCTGCAGCACGTGT 
CACTCAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGT 

40 GGATCAACGAGGACTGCTCCACGCCATGCTCCGGCTCGTGGCTAAGAGATGTTTG 
GGATTGGATATGCACGGTGTTGACTGATTTCAAGACCTGGCTCCAGTCCAAGCTC 
CTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGGGAG 
TCTGGCGGGGCGACGGCATCATGCAAACCACCTGCCCATGTGGAGCACAGATCA 
CCGGACATGTGAAAAACGGTTCCATGAGGATCGTGGGGCCTAGGACCTGTAGTA 

45 ACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTGCACGCC 
CTCCCCGGCGCCAAATTATTCTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTAC 
GTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGACCACTGAC 
AACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACAGAAGTGGATG 
GGGTGCGGTTGCACAGGTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGG 

50 TCACATTCCTGGTCGGGCTCAATCAATACCTGGTTGGGTCACAGCTCCCATGCGA 
GCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCGACCCCTCCCACATT 
ACGGrCGGAGACGGCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCC 
AGCTCATCAGCTATCCAGCTGTCTGCGCCTTCCTTGAAGGCAACATGCACTACCC 
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GTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAACCTCCTGTGGCGGCAGGA 
GATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGA 
CTCTTTCGAGCCGCTCCAAGCGGAGGAGGATGAGAGGGAAGTATCCGTTCCGGC 
GGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGAGCGATGCCCATATGGGCACG 
5 CCCGGATTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCT 
CCAGTGGTACACGGGTGTCCATTGCCGCCTGCCAAGGCCCCTCCGATACCACCTC 
CACGGAGGAAGAGGACGGTTGTCCTGTCAGAATCTACCGTGTCTTCTGCCTTGGC 
GGAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCCCTCCGACGACGGCGACGCGGGATCCGA 
CGTTGAGTCGTACTCCTCCATGCCCCCCCTTGAGGGGGAGCCGGGGGATCCCGAT 
CTCAGCGACGGGTCTTGGTCTACCGTAAGKXjAGGAGGCTAGTGAGGACGTCGTCT 
GCTGCTCGATGTCCTACACATGGACAGGCGCCCTGATCACGCCATGCGCTGCGGA 
GGAAACCAAGCTGCCCATCAATGCACTGAGCAACTCTTTGCTCCGTCACCACAAC 
TTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAAGGTCACCT 
TTGACAGACTGCAGGTCCTGGACGACCACTACCGGGACGTGCTCAAGGAGATGA 
AGGCGAAGGCGTCCACAGTTAAGGCTAAACTTCTATCCGTGGAGGAAGCCTGTA 
AGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGGGCAAAGGACGT 
CCGGAACCTATCCAGCAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGGACTTG 
CTGGAAGACACTGAGACACCAATTGACACCACCATCATGGCAAAAAATGAGGTT 
TTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCTCGCCTTATCGTATTCC 
CAGATTTGGGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTCTCCAC 
CCTCCCTCAGGCCGTGATGGGCTCTTCATACGGATTCCAATACTCTCCTGGACAG 
CGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAAGAAATGCCCTATGGGCTTCG 
CATATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGTGTTGA 
GGAGTCAATCTACCAATGTTGTGACTTGGCCCCCGAAGCCAGACAGGCCATAAG 
GTCGCTCACAGAGCGGCTTTACATCGGGGGCCCCCTGACTAATTCTAAAGGGCAG 
AACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTGCGGT 
AATACCCTCACATGTTACTTGAAGGCCGCTGCGGCCTGTCGAGCTGCGAAGCTCC 
AGGACTGCACGATGCTCGTATGCGGAGACGACCTTGTCGTTATCTGTGAAAGCGC 
GGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGACTAG 
ATACTCTGCCCCCCCTGGGGACCCGCCCAAACCAGAATACGACTTGGAGTTGATA 
ACATCATGCTCCTCCAATGTGTCAGTCGCGCACGATGCATCTGGCAAAAGGGTGT 
ACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGGAGACAGC 
TAGACACACTCCAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGCCCACC 
TTGTGGGCAAGGATGATCCTGATGACTCATTTCTTCTCCATCCTTCTAGCTCAGGA 
ACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTGTTACTCCATTGAG 
CCACTTGACCTACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTTCACT 
CCATAGTTACTCTCCAGGTGAGATCAATAGGGTGGCTTCATGCCTCAGGAAACTT 
GGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCCAGAAGTGTCCGCGCTAGG 
CTACTGTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACTGGG 
CAGTAAGGACCAAGCTCAAACTCACTCCAATCCCGGCTGCGTCCCAGTTGGATTT 
ATCCAGCTGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTGTCT 
CGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGTAGG 
CATCTATCTACTCCCCAACCGATGAACGGGGACCTAAACACTCCAGGCCAATAGG 
CCATCCTGTTTTTTTCCCTTTTTTTTTO 
TTTTCTCCTTITTTTTTC 

CCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGC 
TGATACTGGCCTCTCTGCAGATCAAGT 



SEQIDNO:24: Nucleotide sequence of full-length HCV cDNA clone containing the 
mutation that results in Ser to lie at position 1179 of SEQ ID NO:3 (see Figure 9) 
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GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 
5 GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGTAAC 
ACCAACCGCCGCCCACAGGACGTCAAGTTCCCGGGCGGTGGTCAGATCGTCGGT 
GGAGTTTACCTGTTGCCGCGCAGrGGGCCCCAGGTTGGGTGTGCGCGCGACTAGGA 

10 AGACTTCCGAGCGGTCGCAACCTCGTGGAAGGCGACAACCTATCCCCAAGGCTC 
GCCAGCCCGAGGGTAGGGCCTGGGCTCAGCCCGGGTACCCCTGGCCCCTCTATGG 
CAATGAGGGCTTGGGGTGGGCAGGATGGCTCCTGTCACCCCGTGGCTCTCGGCCT 
AGTTGGGGCCCCACGGACCCCCGGCGTAGGTCGCGCAATTTGGGTAAGGTCATCG 
ATACCCTCACGTGCGGCTTCGCCGATCTCATGGGGTACATTCCGCTCGTCGGCGC 

15 CCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAGGA 
CGGCGTGAACTATGCAACAGGGAATCTGCCCGGTTGCTCCTTTTCTATCTTCCTTT 
TGGCTTTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAAGTGCGCAACGTA 
TCCGGAGTGTACCATGTCACGAACGACTGCTCCAACGCAAGCATTGTGTATGAGG 
CAGCGGACATGATCATGCATACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACA 

20 ACTCCTCCCGCTGCTGGGTAGCGCTCACTCCCACGCTCGCGGCCAGGAACGCTAG 
CGTCCCCACTACGACGATACGACGCCATGTCGATTTGCTCGTTGGGGCGGCTGCT 
CTCTGCTCCGCTATGTACGTGGGAGATCTCTGCGGATCTGTTTTCCTCGTCGCCCA 
GCTGTTCACCTTCTCGCCTCGCCGGCACGAGACAGTACAGGACTGCAATtGCTCA 
ATATATCCCGGCCACGTGACAGGTCACCGTATGGCTTGGGATATGATGATGAACT 

25 GGTCACCTACAGCAGCCCTAGTGGTATCGCAGTTACTCCGGATCCCACAAGCTGT 
CGTGGATATGGTGGCGGGGGCCCATTGGGGAGTCCTAGCGGGCCTTGCCTACTAT 
TCCATGGTGGGGAACTGGGCTAAGGTTCTGATTGTGATGCTACTCTTTGCCGGCG 
TTGACGGGGGAACCTATGTGACAGGGGGGACGATGGCCAAAAACACCCTCGGGA 
TTACGTCCCTCTTTTCACCCGGGTCATCCCAGAAAATCCAGCTTGTAAACACCAA 

30 CGGCAGCTGGCACATCAACAGGACTGCCCTGAACTGCAATGACTCCCTCAACACT 
GGGTTCCTTGCTGCGCTGTTCTACGTGCACAAGTTCAACTCATCTGGATGCCCAG 
AGCGCATGGCCAGCTGCAGCCCCATCGACGCGTTCGCTCAGGGGTGGGGGCCCA 
TCACTTACAATGAGTCACACAGCTCGGACCAGAGGCCTTATTG1TGGCACTACGC 
ACCCCGGCCGTGCGGTATCGTACCCGCGGCGCAGGTGTGTGGTCCAGTGTACTGC 

35 TTCACCCCAAGCCCTGTCGTGGTGGGGACGACCGACCGGTTCGGCGTCCCTACGT 
ACAGTTGGGGGGAGAATGAGACGGACGTGCTGCTTCTTAACAACACGCGGCCGC 
CGCAAGGCAACTGGTTTGGCTGTACATGGATGAATAGCACTGGGTTCACCAAGAC 
GTGCGGGGGCCCCCCGTGTAACATCGGGGGGATCGGCAATAAAACCTTGACCTG 
CCCCACGGACTGCTTCCGGAAGCACCCCGAGGCCACTTACACCAAGTGTGGTTCG 

40 GGGCCTTGGTTGACACCCAGATGCITGGTCCACTACCCAtACAGGCTTTGGCACT 
ACCCCTGCACTGTCAACTTTACCATCTTCAAGGTTAGGATGTACGTGGGGGGAGT 
GGAGCACAGGCTCGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTAACCT 
GGAGGACAGGGACAGATCAGAGCTTAGCCCGCTGCTGCTGTCTACAACGGAGTG 
GCAGGTATTGCCCTGTTCCTTCACCACCCTACCGGCTCTGTCCACTGGTTTGATCC 

45 ATCTCCATCAGAACGTCGTGGACGTACAATACCTGTACGGTATAGGGTCGGCGGT 
TGTCTCCTTTGCAATCAAATGGGAGTATGTCCTGTTGCTCTTCCTTCTTCTGGCGG 
ACGCGCGCGTCTGTGCCTGCTTGTGGATGATGCTGCTGATAGCTCAAGCTGAGGC 
CGCCCTAGAGAACCTGGTGGTCCTCAACGCGGCATCCGTGGCCGGGGCGCATGG 
CATTCTCTCCITCCTCGTGTTCTTCTGTGCTGCCTGGTACATCAAGGGCAGGCTGG 

50 TCCCTGGGGCGGCATATGCCCTCTACGGCGTATGGCCGCTACTCCTGCTCCTGCTG 
GCGTTACCACCACGAGCATACGCCATGGACCGGGAGATGGCAGCATCGTGCGGA 
GGCGCGGTTTTCGTAGGTCTGATACTCTTGACCTTGTCACCGCACTATAAGCTGTT 
CCTCGCTAGGCTCATATGKjTGGTTACAATATTTTATCACCAGGGCCGAGGCACAC 



WO 01/89364 



PCT/US01/16822 



98 

TTGCAAGTGTGGATCCCCCCCCTCAACGTTCGGGGGGGCCGCGATGCCGTCATCC 
TCCTCACGTGCGCGATCCACCCAGAGCTAATC1TTACCATCACCAAAATCTTGCTC 
GCCATACTCGGTCCACTCATGGTGCTCCAGGCTGGTATAACCAAAGTGCCGTACT 
TCGTGCGCGCACACGGGCTCATTCGTGCATGCATGCTGGTGCGGAAGGTTGCTGG 

5 GGGTCATTATGTCCAAATGGCTCTCATGAAGTTGGCCGCACTGACAGGTACGTAC 
GTTTATGACCATCTCACCCCACTGCGGGACTGGGCCCACGCGGGCCTACGAGACC 
TTGCGGTGGCAGTTGAGCCCGTCGTC1TCTCTGATATGGAGACCAAGGTTATCAC 
CTGGGGGGCAGACACCGCGGCGTGTGGGGACATCATCTTGGGCCTGCCCGTCTCC 
GCCCGCAGGGGGAGGGAGATACATCTGGGACCGGCAGACAGCCTTGAAGGGCAG 

10 GGGTGGCGACTCCTCGCGCCTATTACGGCCTACTCCCAACAGACGCGAGGCCTAC 
TTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGGGGG 
AGGTCCAAGTGGTCTCCACCGCAACACAATCTTTCCTGGCGACCTGCGTCAATGG 
CGTGTGTTGGACTGTCTATCATGGTGCCGGCTCAAAGACCCTTGCCGGCCCAAAG 
GGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCGGCTGGCAA 

15 GCGCCCCCCGGGGCGCGTTCCTTGACACCATGCACCTGCGGCAGCTCGGACCTTT 
ACTTGGTCACGAGGCATGCCGATGTCATTCCGGTGCGCCGGCGGGGCGACAGCA 
GGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGCTCTTCGGGCGG 
TCCACTGCTCTGCCCCTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGC 
ACCCGAGGGGTTGCGAAGGCGGTGGACTTTGTACCCGTCGAGTCTATGGAAACC 

20 ACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTCCGGCCGTACCGCAGA 
CATTCCAGGTGGCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGT 
GCCGGCTGCGTATGCAGCCCAAGGGTATAAGGTGCTTGTCCTGAACCCGTCCGTC 
GCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAAGGCACATGGTATCGACCCTA 
ACATCAGAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCA 

25 CCTATGGCAAGTTTCTTGCCGACGGTGGTTGCTCTGGGGGCGCCTATGACATCAT 
AATATGTGATGAGTGCCACTCAACTGACTCGACCACTATCCTGGGCATCGGCACA 
GTCCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCT 
ACGCCTCCGGGATCGGTCACCGTGCCACATCCAAACATCGAGGAGGTGGCTCTGT 
CCAGCACTGGAGAAATCCCCTTTTATGGCAAAGCCATCCCCATCGAGACCATCAA 

30 GGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTCGCC 
GCGAAGCTGTCCGGCCTCGGACTCAATGCTGTAGCATATTACCGGGGCCTTGATG 
TATCCGTCATACCAACTAGCGGAGACGTCATTGTCGTAGCAACGGACGCTCTAAT 
GACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACATGTGTCACC 
CAGACAGTCGACTTCAGCCTGGACCCGACCTTCACCATTGAGACGACGACCGTGC 

35 CACAAGACGCGGTGTCACGCTCGCAGCGGCGAGGCAGGACTGGTAGGGGCAGGA 
TGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGGGCATGTTCGATTC 
CTCGGTTCTGTGCGAGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCC 
GCCGAGACCTCAGTTAGGTTGCGGGCTTACCTAAACACACCAGGGTTGCCCGTCT 
GCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACAGGCCTCACCCACATAGA 

40 CGCCCATTTCTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTA 
GCATACCAGGCTACGGTGTGCGCCAGGGCTCAGGCTCCACCTCCATCGTGGGACC 
AAATGTGGAAGTGTCTCATACGGCTAAAGCCTACGCTGCACGGGCCAACGCCCCT 
GCTGTATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACC 
AAATACATCATGGCATGCATGTCGGCTGACCTGGAGGTCGTCACGAGCACCTGGG 

45 TGCTGGTAGGCGGAGTCCTAGCAGCTCTGGCCGCGTATTGCCTGACAACAGGCAG 
CGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCGAC 
AGGGAAGTCCTTTACCGGGAGTTCGATGAGATGGAAGAGTGCGCCTCACACCTCC 
CTTACATCGAACAGGGAATGCAGCTCGCCGAACAATTCAAACAGAAGGCAATCG 
GGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGTGGAAT 

50 CCAAGTGGCGGACCCTCGAAGCCTTCTGGGCGAAGCATATGTGGAATTTCATCAG 
CGGGATACAATATTTAGCAGGCTTGTCCACTCTGCCTGGCAACCCCGCGATAGCA 
TCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACCCAACATACCC 
TCCTGTITAACATCCTGGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGC 
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TGCTTCTGCTTTCGTAGGCGCCGGCATCGCTGGAGCGGCTGTTGGCAGCATAGGC 
CTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAGCAGGGGTGGCAGGCG 
CGCTCGTGGCCTTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGT 
TAACCTACTCCCTGCTATCCTCTCCCCTGGCGCCCTAGTCGTCGGGGTCGTGTGCG 
5 CAGCGATACTGCGTCGGCACGTGGGCCCAGGGGAGGGGGCTGTGCAGTGGATGA 
ACCGGCTGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGT 
GCCTGAGAGCGACGCTGCAGCACGTGTCACTCAGATCCTCTCTAGTCTTACCATC 
ACTCAGCTGCTGAAGAGGCTTCACCAGTGGATCAACGAGGACTGCTCCACGCCAT 
GCTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 

10 TTTCAAGACCTGGCTCCAGTCCAAGCTCCTGCCGCGATTGCCGGGAGTCCCCTTCT 
TCTCATGTCAACGTGGGTACAAGGGAGTCTGGCGGGGCGACGGCATCATGCAAA 
CCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTCCATGA 
GGATCGTGGGGCCTAGGACCTGTAGTAACACGTGGCATGGAACATTCCCCATTAA 
CGCGTACACCACGGGCCCCTGCACGCCCTCCCCGGCGCCAAATTATTCTAGGGCG 

15 CTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTGGGGGATTTC 
CACTACGTGACGGGCATGACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGG 
CCCCCGAATTCTTCACAGAAGTGGATGGGGTGCGGTTGCACAGGTACGCTCCAGC 
GTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGGGCTCAATCAATAC 
CTGGTTGGGTCACAGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTT 

20 CCATGCTCACCGACCCCTCCCACATTACGGCGGAGACGGCTAAGCGTAGGCTGGC 
CAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCTATCCAGCTGTCTGCGCCTT 
CCTTGAAGGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGA 
GGCCAACCTCCTGTGGCGGCAGGAGATGGGCGGGAACATCACCCGCGTGGAGTC 
AGAAAATAAGGTAGTAATTTTGGACTCTTTCGAGCCGCTCCAAGCGGAGGAGGA 

25 TGAGAGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCC 
TCGAGCGATGCCCATATGGGCACGCCCGGATTACAACCCTCCACTGTTAGAGTCC 
TGGAAGGACCCGGACTACGTCCCTCCAGTGGTACACGGGTGTCCATTGCCGCCTG 
CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCGGAGCTCGCCACAAAGACCTTCGGCAGCTC 

30 CGAATCGTCGGCCGTCGACAGCGGCACGGCAACGGCCTCTCCTGACCAGCCCTCC 
GACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCCCCTTG 
AGGGGGAGCCGGGGGATCCCGATCTCAGCGACGGGTCTTGGTCTACCGTAAGCG 
AGGAGGCTAGTGAGGACGTCGTCTGCTGCTCGATGTCCTACACATGGACAGGCGC 
CCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAATGCACTGAG 

35 CAACTCTTTGCTCCGTCACCACAACTTGGTCTATGCTACAACATCTCGCAGCGCA 
AGCCTGCGGCAGAAGAAGGTCACCTTTGACAGACTGCAGGTCCTGGACGACCAC 
TACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACAGTTAAGGCTAAA 
CTTCTATCCGTGGAGGAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATCTA 
AATTTGGCTATGGGGCAAAGGACGTCCGG AACCTATCCAGCAAGGCCGTTAACC 

40 ACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGAGACACCAATTGACA 
CCACCATCATGGCAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCC 
GCAAGCCAGCTCGCCTTATCGTATTCCCAGATTTGGGGGTTCGTGTGTGCGAGAA 
AATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAGGCCGTGATGGGCTCTTCAT 
ACGGATTCCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAA 

45 AGCGAAGAAATGCCCTATGGGCTrCGCATATGACACCCGCTGTTTTGACTCAACG 
GTCACTGAGAATGACATCCGTGTTGAGGAGTCAATCTACCAATGTTGTGACTTGG 
CCCCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGG 
GCCCCCTGACTAATTCTAAAGGGCAGAACTGCGGCTATCGCCGGTGCCGCGCGAG 
CGGTGTACTGACGACCAGCTGCGGTAATACCCTCACATGTTACTTGAAGGCCGCT 

50 GCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGGAGAC 
GACCTTGTCGTTATCTGTGAAAGCGCGGGGACCCAAGAGGACGAGGCGAGCCTA 
CGGGCCTTCACGGAGGCTATGACTAGATACTCTGCCCCCCCTGGGGACCCGCCCA 
AACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTCAGTCGC 
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GCACGATGCATCTGGCAAAAGGGTGTACTATCTCACCCGTGACCCCACCACCCCC 
CTTGCGCGGGCTGCGTGGGAGACAGCTAGACACACTCCAGTCAATTCCTGGCTAG 
GCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATCCTGATGACTCA 
TTTCTTCTCCATCCTTCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGA 
5 TCTACGGGGCCTGTTACTCCATTGAGCCACTTGACCTACCTCAGATCATTCAACG 
ACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTACTCTCCAGGTGAGATCAATA 
GGGTGGCTTCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACA 
TCGGGCCAGAAGTGTCCGCGCTAGGCTACTGTCCCAGGGGGGGAGGGCTGCCAC 
TTGTGGCAAGTACCTCTTCAACTGGGCAGTAAGGACCAAGCTCAAACTCACTCCA 

10 ATCCCGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGG 
GGGAGACATATATCACAGCCTGTCTCGTGCCCGACCCCGCTGGTTCATGTGGTGC 
CTACTCCTACTTTCTGTAGGGGTAGGCATCTATCTACTCCCCAACCGATGAACGG 
GGACCTAAACACTCCAGGCCAATAGGCCATCCTGTTT^^ 
TTTTTTTTTTTTTTTT111 

15 TTCTTTCCTTTGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCC 
GTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGATCAAGT 

SEQIDNO:25: Nucleotide sequence of DNA clone of HCV adaptive replicon 5TSITR- 
EMCV/HCVrepVE 

20 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAAC 
TACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAG 
CCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTA 
CACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCT 

25 GGAGATTTGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGA 
AAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGT 
AGACCGTGCACCAGACCACAACGGTTTCCCTCTAGCGGGATCAATTCCGCCCCTC 
TCCCTCCCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGT 
GCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGC 

30 CCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCG 
CCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAG 
CTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCC 
ACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGC 
AAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGT 

35 CAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGT 
ACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTT 
AGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTITTCCTT 
TGAAAAACACGATAATACCATGGCGCCTATTACGGCCTACTCCCAACAGACGCG 
AGGCCTACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTC 

40 GAGGGGGAGGTCCAAGTGGTCTCCACCGCAACACAATCTTTCCTGGCGACCTGCG 
TCAATGGCGTGTGTTGGACTGTCTATCATGGTGCCGGCTCAAAGACCCTTGCCGG 
CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCGG 
CTGGCAAGCGCCCCCCGGGGCGCGTTCCTTGACACCATGCACCTGCGGCAGCTCG 
GACCTTTACTTGGTCACGAGGCATGCCGATGTCATTCCGGTGCGCCGGCGGGGCG 

45 ACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGCTCTTC 
GGGCGGTCCACTGCTCTGCCCCTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCC 
GTGTGCACCCGAGGGGTTGCGAAGGCGGTGGACTTTGTACCCGTCGAGTCTATGG 
AAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTCCGGCCGTACC 
GCAGACATTCCAGGTGGCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACT 

50 AAGGTGCCGGCTGCGTATGCAGCCCAAGGGTATAAGGTGCTTGTCCTGAACCCGT 
CCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAAGGCACATGGTATCGA 
CCCTAACATCAGAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTA 
CTCCACCTATGGCAAGTTTCTTGCCGACGGTGGTTGCTCTGGGGGCGCCTATGAC 
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ATCATAATATGTGATGAGTGCCACTCAACTGACTCGACCACTATCCTGGGCATCG 
GCACAGTCCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCA 
CCGCTACGCCTCCGGGATCGGTCACCGTGCCACATCCAAACATCGAGGAGGTGGC 
TCTGTCCAGCACTGGAGAAATCCCCTTTTATGGCAAAGCCATCCCCATCGAGACC 
5 ATCAAGGGGGGGAGGCACCIGATTTTCTGCCATTCCAAGAAGAAATGTGATGAG 
CTCGCCGCGAAGCTGTCCGGCCTCGGACTCAATGCTGTAGCATATTACCGGGGCC 
TTGATGTATCCGTCATACCAACTAGCGGAGACGTCATTGTCGTAGCAACGGACGC 
TCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACATGT 
GTCACCCAGACAGTCGACTTCAGCCTGGACCCGACCTTCACCATTGAGACGACGA 
10 CCGTGCCACAAGACGCGGTGTCACGCTCGCAGCGGCGAGGCAGGACTGGTAGGG 
GCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGGGCATGTT 
CGATTCCTCGGTTCTGTGCGAGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTC 
ACGCCCGCCGAGACCTCAGTTAGGTTGCGGGCTTACCTAAACACACCAGGGTTGC 
CCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACAGGCCTCACCCA 

CTGGTAGCATACCAGGCTACGGTGTGCGCCAGGGCTCAGGCTCCACCTCCATCGT 
GGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTACGCTGCACGGGCCAA 
CGCCCCTGCTGTATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCC 
CATAACCAAATACATCATGGCATGCATGTCGGCTGACCTGGAGGTCGTCACGAGC 

20 ACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCTGGCCGCGTATTGCCTGACAA 
CAGGCAGCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCAT 
TCCCGACAGGGAAGTCCTTTACCGGGAGTTCGATGAGATGGAAGAGTGCGCCTC 
ACACCTCCCTTACATCGAACAGGGAATGCAGCTCGCCGAACAATTCAAACAGAA 
GGCAATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGT 

25 GGTGGAATCCAAGTGGCGGACCCTCGAAGCCTTCTGGGCGAAGCATATGTGGAA 
TTTCATCAGCGGGATACAATATlTAGCAGGCTTGTCCACTCTGrCCTGGCAACCCC 
GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACCC 
AACATACCCTCCTGTTTAACATCCTGGGGGGATGGGTGGCCGCCCAACTTGCTCC 
TCCCAGCGCTGCTTCTGCTTTCGTAGGCGCCGGCATCGCTGGAGCGGCTGTTGGC 

30 AGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAGCAGGGG 
TGGCAGGCGCGCTCGTGGCCTTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGA 
GGACCTGGTTAACCTACTCCCTGCTATCCTCTCCCCTGGCGCCCTAGTCGTCGGGG 
TCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGGAGGGGGCTGTGC 
AGTGGATGAACCGGCTGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCAC 

35 GCACTATGTGCCTGAGAGCGACGCTGCAGCACGTGTCACTCAGATCCTCTCTAGT 
CTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGGATCAACGAGGACTGCT 
CCACGCCATGCTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGT 
GTTGACTGATTTCAAGACCTGGCTCCAGTCCAAGCTCCTGCCGCGATTGCCGGGA 
GTCCCCTTCTTCTCATGTCAACGTGGGTACAAGGGAGTCTGGCGGGGCGACGGCA 

40 TCATGCAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACG 
GTTCCATGAGGATCGTGGGGCCTAGGACCTGTAGTAACACGTGGCATGGAACATT 
CCCCATTAACGCGTACACCACGGGCCCCTGCACGCCCTCCCCGGCGCCAAATTAT 
TCTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATGACCACTGACAACGTAAAGTGCCCGTGTC 

45 AGGTTCCGGCCCCCGAATTCTTCACAGAAGTGGATGGGGTGCGGTTGCACAGGTA 
CGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGGGCTC 
AATCAATACCTGGTTGGGTCACAGCTCCCATGCGAGCCCGAACCGGACGTAGCA 
GTGCTCACTTCCATGCTCACCGACCCCTCCCACATTACGGCGGAGACGGCTAAGC 
GTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCTATCCAGCT 

50 GTCTGCGCCTTCCTTGAAGGCAACATGCACTACCCGTCATGACTCCCCGGACGCT 
GACCTCATCGAGGCCAACCTCCTGTGGCGGCAGGAGATGGGCGGGAACATCACC 
CGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCGAGCCGCTCCAAG 
CGGAGGAGGATGAGAGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCA 
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GGAAATTCCCTCGAGCGATGCCCATATGGGCACGCCCGGATTACAACCCTCCACT 
GTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGTGGTACACGGGTGTCCA 
TTGCCGCCTGCCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTG 
TCCTGTCAGAATCTACCGTGTCTTCTGCCTTGGCGGAGCTCGCCACAAAGACCTTC 
5 GGCAGCTCCGAATCGTCGGCCGTCGACAGCGGCACGGCAACGGCCTCTCCTGACC 
AGCCCTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCC 
CCCCCTTGAGGGGGAGCCGGGGGATCCCGATCTCAGCGACGGGTCTTGGTCTACC 
GTAAGCGAGGAGGCTAGTGAGGACGTCGTCTGCTGCTCGATGTCCTACACATGGA 
CAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAATG 

10 CACTGAGCAACTCTTTGCTCCGTCACCACAACTTGGTCTATGCTACAACATCTCGC 
AGCGCAAGCCTGCGGCAGAAGAAGGTCACCTTTGACAGACTGCAGGTCCTGGAC 
GACCACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACAGTTAAG 
GCTAAACTTCTATCCGTGGAGGAAGCCTGTAAGCTGACGCCCCCACATTCGGCCA 
GATCTAAATTTGGCTATGGGGCAAAGGACGTCCGGAACCTATCCAGCAAGGCCG 

15 TTAACCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGAGACACCAAT 
TGACACCACCATCATGGCAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGG 
GGGCCGCAAGCCAGCTCGCCTTATCGTATTCCCAGATTTGGGGGTTCGTGTGTGC 
GAGAAAATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAGGCCGTGATGGGCT 
CTTCATACGGATTCCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGC 

20 CTGGAAAGCGAAGAAATGCCCTATGGGCTTCGCATATGACACCCGCTGTTTTGAC 
TCAACGGTCACTGAGAATGACATCCGTGTTGAGGAGTCAATCTACCAATGTTGTG 
ACTTGGCCCCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACAT 
CGGGGGCCCCCTGACTAATTCTAAAGGGCAGAACTGCGGCTATCGCCGGTGCCGC 
GCGAGCGGTGTACTGACGACCAGCTGCGGTAATACCCTCACATGTTACTTGAAGG 

25 CCGCTGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGG 
AGACGACCTTGTCGTTATCTGTGAAAGCGCGGGGACCCAAGAGGACGAGGCGAG 
CCTACGGGCCTTCACGGAGGCTATGACTAGATACTCTGCCCCCCCTGGGGACCCG 
CCCAAACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTCAG 
TCGCGCACGATGCATCTGGCAAAAGGGTGTACTATCTCACCCGTGACCCCACCAC 

30 CCCCCTTGCGCGGGCTGCGTGGGAGACAGCTAGACACACTCCAGTCAATTCCTGG 
CTAGGCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATCCTGATGA 
CTCATTTCTTCTCCATCCTTCTAGCTCAGGAAC^CnTGAAAAAGCCCTAGATTGT 
CAGATCTACGGGGCCTGTTACTCCATTGAGCCACTTGACCTACCTCAGATCATTC 
AACGACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTACTCTCCAGGTGAGATC 

35 AATAGGGTGGCTTCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGA 
GACATCGGGCCAGAAGTGTCCGCGCTAGGCTACTGTCCCAGGGGGGGAGGGCTG 
CCACTTGTGGCAAGTACCTCTTCAACTGGGCAGTAAGGACCAAGCTCAAACTCAC 
TCCAATCCCGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACA 
GCGGGGGAGACATATATCACAGCCTGTCTCGTGCCCGACCCCGCTGGTTCATGTG 

40 GTGCCTACTCCTACTTTCTGTAGGGGTAGGCATCTATCTACTCCCCAACCGATGAA 
CGGGGACCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTlTrCCCTTTT^ 
TTCTTTTTTTTTTTTTIT^^ 

CCTTTTCTTTCCTTTGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAG 
GTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGATCAA 
45 GT 
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What is claimed is: 

1 . A polynucleotide comprising a non-naturally occurring HCV sequence that is 
capable of productive replication in a host cell, or is capable of being transcribed into a non- 
naturally occurring HCV sequence that is capable of productive replication in a host cell, 
wherein the HCV sequence comprises, from 5' to 3' on the positive-sense nucleic acid, a 

5 functional 5' non-translated region (5* NTR); one or more protein coding regions, including at 
least one polyprotein coding region that is capable of replicating HCV RNA; and a functional 
HCV 3' non-translated region (3' NTR). 

2. The polynucleotide of claim 1, further comprising an adaptive mutation. 

3. The polynucleotide of claim 2, having a transfection efficiency into mammalian 
cells of greater than 0.01%. 

4. The polynucleotide of claim 3, wherein the transfection efficiency into mammalian 
cells is greater than 0.1%. 

5. The polynucleotide of claim 3, wherein the transfection efficiency into mammalian 
cells is greater than 1%. 

6. The polynucleotide of claim 3, wherein the transfection efficiency into mammalian 
cells is greater than 5%. 

7. The polynucleotide of claim 2, wherein the polynucleotide is capable of 
replication in a non-hepatic cell. 

8. The polynucleotide of claim 7, wherein the non-hepatic cell is a HeLa cell. 

9. The polynucleotide of claim 2, wherein the HCV is impaired in its ability to cause 
disease, establish chronic infections, trigger autoimmune responses, and transform cells. 

10. The polynucleotide of claim 2, wherein the polyprotein region comprises an 
NS5A gene that is not a wild-type NS5A gene. 

1 1 . The polynucleotide of claim 10, wherein the NS5 A gene comprises a mutation. 
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12. The polynucleotide of claim 1 1, wherein the mutation is within 50 nucleotides of 
an ISDR or includes fee ISDR. 

13. The polynucleotide of claim 12, wherein the mutation is within 20 nt of the 
ISDR, or includes the ISDR. 

14. The polynucleotide of claim 13, wherein the mutation encodes an amino acid 
sequence change selected from the group consisting of Ser (1 179) to lie, Arg (1 164) to Gly, 
Ala(l 174) to Ser, Ser(l 172) to Cys, and Ser(l 172) to Pro of SEQ ID NO:3 . 

15. The polynucleotide of claim 11, wherein the mutation comprises a deletion of at 
least a portion of the ISDR. 

16. The polynucleotide of claim 15, wherein the mutation comprises a deletion of the 
entire ISDR. 

17. The polynucleotide of claim 16, wherein the mutation comprises a deletion of 
nucleotides corresponding to nucleotides 5345 to 5485 of SEQ ID NO:6. 

18. The polynucleotide of claim 1, wherein the polynucleotide comprises at least one 
IRES selected from the group consisting of a viral IRES, a cellular IRES, and an artificial 
IRES. 

19. The polynucleotide of claim 18, wherein the HCV polyprotein coding region 
encodes all HCV structural and nonstructural proteins. 

20. The polynucleotide of claim 19, farmer-comprising a foreign gene operably 
linked to a first IRES and the HCV polyprotein coding region operably linked to a second 
IRES. 

21. The polynucleotide of claim 18, wherein the polyprotein coding region is 
incapable of making infectious HCV particles. 
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22. The polynucleotide of claim 21, wherein the polyprotein coding region comprises 
a mutation and/or a deletion in the structural protein coding region. 

23. The polynucleotide of claim 22, further comprising a foreign gene operably 
linked to a first IRES and the HCV polyprotein coding region operably linked to a second 
IRES. 

24. The polynucleotide of claim 23, wherein the foreign gene is a gene encoding a 
selectable marker or a reporter gene. 

25. The polynucleotide of claim 24, further comprising an adaptive mutation. 

26. The polynucleotide of claim 25, having a transfection efficiency into mammalian 
cells of greater than 0.01%. 

27. The polynucleotide of claim 26, wherein the transfection efficiency into 
mammalian cells is greater than 1%. 

28. The polynucleotide of claim 26, wherein the transfection efficiency into 
mammalian cells is greater than 5%. 

29. The polynucleotide of claim 26, wherein the transfection efficiency into 
mammalian cells is about 6%. 

30. The polynucleotide of claim 25, wherein the polynucleotide is capable of 
replication in a non-hepatic cell. 

3 1 . The polynucleotide of claim 30, wherein the non-hepatic cell is a HeLa cell. 

32. The polynucleotide of claim 25, wherein the HCV is impaired in its ability to 
cause disease, establish chronic infections, trigger autoimmune responses, and transform cells. 

33 . The polynucleotide of claim 25, wherein the polyprotein region comprises an 
NS5A gene that is not a wild-type NS5A gene. 
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34. The polynucleotide of claim 33, wherein the NS5A gene comprises a mutation. 

35. The polynucleotide of claim 34, wherein the mutation is within 50 nucleotides of 
an ISDR or includes the ISDR. 

36. The polynucleotide of claim 34, wherein the mutation is within 20 nt of the 
ISDR, or includes the ISDR. 

37. The polynucleotide of claim 36, wherein the mutation encodes an amino acid 
sequence change selected from the group consisting of Ser (1 179) to lie, Arg (1 164) to Gly, 
Ala(l 174) to Ser, Ser(l 172) to Cys, and Ser(l 172) to Pro of SEQ ID NO:3. 

38. The polynucleotide of claim 34, wherein the mutation comprises a deletion of at 
least a portion of the ISDR. 

39. The polynucleotide of claim 38, wherein the mutation comprises a deletion of the 
entire ISDR. 

40. The polynucleotide of claim 39, wherein the mutation comprises a deletion of 
nucleotides corresponding to nucleotides 5345 to 5485 of SEQ ID NO:6. 

41. The polynucleotide of claim 24, wherein: 

(a) the first IRES is an HCV IRES; 

(b) the foreign gene is a neo gene; and 

(c) tibe second IRES is a EMCV IRES. 

42. The polynucleotide of claim 41, wherein the HCV sequence is a genotype 1 HCV 
sequence. 

43. The polynucleotide of claim 42, wherein the HCV sequence is subtype lb. 

44. The polynucleotide of claim 41, comprising SEQ ID NO:5 or SEQ ID NO:6. 



45. The polynucleotide of claim 41, further comprising an adaptive mutation. 
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46. The polynucleotide of claim 45, having a transfection efficiency into mammalian 
cells of greater than 0.01%. 

47. The polynucleotide of claim 46, wherein the transfection efficiency into 
mammalian cells is greater than 1%. 

48. The polynucleotide of claim 46, wherein the transfection efficiency into 
mammalian cells is greater than 5%. 

49. The polynucleotide of claim 46, wherein the transfection efficiency into 
mammalian cells is about 6%. 

50. The polynucleotide of claim 45, wherein the polynucleotide is capable of 
replication in a non-hepatic cell. 

51. The polynucleotide of claim 50, wherein the non-hepatic cell is a HeLa cell. 

52. The polynucleotide of claim 45, wherein the HCV is impaired in its ability to 
cause disease, establish chronic infections, trigger autoimmune responses, and transform cells. 

53. The polynucleotide of claim 45, wherein the polyprotein region comprises an 
NS5 A gene that is not a wild-type NS5A gene. 

54. The polynucleotide of claim 53, wherein the NS5 A gene comprises a mutation. 

55. The polynucleotide of claim 54, wherein the mutation is within 50 nucleotides of 
an ISDR or includes the ISDR 

56. The polynucleotide of claim 54, wherein the mutation is within 20 nt of the 
ISDR, or includes the ISDR. 

57. The polynucleotide of claim 56, wherein the mutation encodes an amino acid 
sequence change selected from the group consisting of Ser (1 179) to lie, Arg (1 164) to Gly, 
Ala(l 174) to Ser, Ser(l 172) to Cys, and Ser(l 172) to Pro of SEQ ID NO:3. 
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58. The polynucleotide of claim 54, wherein the mutation comprises a deletion of at 
least a portion of the ISDR. 

59. The polynucleotide of claim 58, wherein the mutation comprises a deletion of the 
entire ISDR. 

60. The polynucleotide of claim 59, wherein the mutation comprises a deletion of 
nucleotides corresponding to nucleotides 5345 to 5485 of SEQ ED NO:6. 

61. The polynucleotide of claim 1, wherein the polynucleotide is double-stranded 

DNA. 

62. A vector comprising the polynucleotide of claim 61 operably associated with a 
promoter. 

63. The polynucleotide of claim 41 wherein the polynucleotide is double-stranded 

DNA. 

64. A vector comprising the polynucleotide of claim 63 operably associated with a 
promoter. 

65. The vector of claim 64, further comprising a mutation in the NS5A gene. 

66. The vector of claim 65, wherein the mutation is selected from the group 
consisting of mutations encoding the amino acid changes Ser (1 179) to He, Arg (1 164) to Gly, 
Ala(l 174) to Ser, Ser(1172) to Cys, and Ser(l 172) to Pro of SEQ ID NO:3; and an in frame 
deletion of nucleotides encoding amino acids comprising at least a portion of the ISDR. 

67. The vector of claim 66, wherein the mutation comprises a deletion of the entire 

ISDR. 

68. The vector of claim 67, wherein the mutation comprises a deletion of nucleotides 
corresponding to nucleotides 5345 to 5485 of SEQ ID NO:6. 

69. A cell comprising the vector of claim 62. 
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70. A host cell comprising the polynucleotide of claim 2, wherein the host cell is a 
mammalian cell. 

71. The host cell of claim 70, wherein the polynucleotide comprises an adaptive 
mutation. 

72. The host cell of claim 71 wherein the host cell is a human cell. 

73. The host cell of claim 72 wherein the host cell is a liver cell. 

74. The host cell of claim 72 wherein Hie host cell is a T-cell or a B-cell. 

75. The host cell of claim 72 wherein the host cell is a HeLa cell. 

76. A method for identifying a cell line that is permissive for infection with HCV, 
comprising contacting a cell in tissue culture with an infectious amount of the polynucleotide 
of claim 1 , and detecting replication of HCV in cells of the cell line. 

77. A method for producing a cell line comprising replicating HCV, the method 
comprising , 

(a) transcribing the vector of claim 62 to synthesize HCV RNA; 

(b) transfecting a cell with the HCV RNA of step (a); and 
5 (c) culturing the cell. 

78. A vaccine comprising Hie polynucleotide of claim 1 in a pharmaceutically 
acceptable carrier. 

79. The vaccine of claim 78, wherein the polynucleotide further comprises an 
adaptive mutation. 

80. The vaccine of claim 79, wherein the adaptive mutation comprises a deletion of 
nucleotides corresponding to nucleotides 5345 to 5485 of SEQ ID NO:6. 
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' 8 1 . The vaccine of claim 80, wherein the HCV is impaired in its ability to cause 
disease, establish chronic infections, trigger autoimmune responses, and transform cells. 

82. A method of inducing irnmunoprotection to HCV in a primate, comprising 
administering to the primate the vaccine of claim 78. 

83. A method of inducing irnmunoprotection to HCV in a primate, comprising 
admmistering to the primate the vaccine of claim 8 1 . 

84. A method of testing a compound for inhibiting HCV replication, comprising 

(a) treating the host cell of claim 70 with the compound; 

(b) evaluating the treated host cell for reduced HCV replication, wherein reduced 
HCV replication indicates the ability of the compound to inhibit HCV replication. 

85. A method of testing a compound for intabiting HCV infection comprising 
treating a host cell with the compound before, during or after infecting or transfecting the host 
cell with the polynucleotide of claim 1 . 
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Figure 2 



WO 01/89364 



4/11 



PCT/US01/16822 




WO 01/89364 



5/11 



PCT/US01/16822 



Avail 

■ Ammo Jj| H „ EE 5 I -^L /oo^J 3* 

GDD->AAG 
pol- 

DNase digest RNA transcripts 
Eiectroporate RNA into Huh7 cells 
G418-resistant colonies were generated at low frequency 
28 colonies were picked & 90% of these could be passaged 
• No colonies observed for the replicon RNA containing an inactive RDRP 



Clone 


Copy number/cell 


Cytoplasmic NS3 


Growth Rate 


I 


>1000 


Yes 


Fast 


n 


-1000-5000 


Yes 


Fast 


IV 


ND 


Yes 


Fast 


V 


500 


ND 


Moderate 


VI 


-1000 


Yes 


Fast 


vn 


>800 


Yes 


Fast 


Clone E 


<400 


No 


Very slow 



Figure 4 



WO 01/89364 



6/11 



PCT/US01/16822 




Figure 5 



WO 01/89364 



7/11 



PCTYUS01/16822 




V© 



WO 01/89364 



8/11 



PCT/US01/16822 



I St 



sS V : : : : : 



Si 



. u 

OK V 

Si 

Jig 



si 

~ o 



6 



51 

S . . . 



h S 



WO 01/89364 



9/11 



PCT/US01/16822 




WO 01/89364 



11/11 



PCT/US01/16822 




Figure 10 



WO 01/89364 



1 



PCT/US01/16822 



SEQUENCE LISTING 

<110> Rice III, Charles 
Blight, Keril 

<120> HCV Variants 

<130> 6029-7868 

<140> 
<141> 

<150> 09/576,989 
<151> 2000-05-23 

<150> 09/034,756 
<151> 1998-03-04 

<160> 24 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 21 
<212> DNA 

<213> Hepatitis C virus 
<400> 1 

ggcgacactc caccatagat c 21 

<210> 2 
<211> 99 
<212> DNA 

<213> Hepatitis C virus 
<400> 2 

tggtggctcc atcttagccc tagtcacggc tagctgtgaa aggtccgtga gccgcatgac 60 
tgcagagagt gctgatactg gcctctctgc tgatcatgt 99 

<210> 3 v 
<211> 1985 
<212> PRT 

<213> Hepatitis C virus 
<400> 3 

Met Ala Pro lie Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
1 5 10 15 

Cys He He Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 
35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 
50 55 60 

Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu Thr 
85 90 " 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 
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Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
165 170 175 

Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro 
180 185 190 

Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr Gly 
195 200 205 

Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr 
210 215 220 

Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
225 230 235 240 

Ala Tyr Met Ser Lys Ala His Gly He Asp Pro Asn He Arg Thr Gly 
245 250 255 

Val Arg Thr He Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr Gly 
260 265 "~ 270 

Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He 
275 280 285 

He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly He 
290 ' 295 300 

Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val 
305 310 315 320 

Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 

325 • 330 335 

He Glu Glu Val Ala Leu Ser Ser Thr Gly Glu He Pro Phe Tyr Gly 
340 345 350 

Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He Phe 
355 360 365 

Cys His Ser Lys Lys Lys Cys Asp Glu' Leu Ala Ala Lys Leu Ser Gly 
370 375 380 

Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val 
385 390 395 400 

He Pro Thr Ser Gly Asp Val He Val Val Ala Thr Asp Ala Leu Met 
405 410 415 



Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu 
435 440 ~ 445 

Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
450 455 460 
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Arg Thr Gly Arg Gly Arg Met Gly lie Tyr Arg Phe Val Thr Pro Gly 
465 470 475 480 

Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr 
485 490 495 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser Val 
500 505 510 

Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
515 520 525 

His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His lie Asp 
530 535 540 

Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
545 550 555 560 

Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
565 570 575 

Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie Arg Leu Lys Pro Thr 
580 585 590 



Glu Val Thr Thr Thr His Pro lie Thr Lys Tyr He Met Ala Cys Met 
610 615 . 620 

Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly 
625 630 635 640 

Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val 
645 650 655 

He Val Gly Arg He He Leu Ser Gly Lys Pro Ala He He Pro Asp 
660 665 670 

Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
675 680 685 

His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys 
690 695 700 

Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin 'Ala Glu Ala 
705 710 715 720 

Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe Trp 
725 730 735 

Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
740 745 750 

Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala Phe 
755 760 7.65 

Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu Phe 
770 775 780 

Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala 
785 790 795 800 

Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser 
805 810 815 
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lie Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
820 825 830 

Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met 
835 840 845 

Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro 
850 855 860 

Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 
865 870 875 880 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
885 890 895 

Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu 
900 905 910 

Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr He 
915 920 925 

Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser 
930 ~ 935 * 940 

Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys 
945 950 955 960 

Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro 
965 970 975 



Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala 

995 1000 1005 

Gin He Thr Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro 

1010 1015 1020 

Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr 

1025 1030 1035 1040 



Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro 
1075 1080 1085 

Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg 
1090 1095 1100 

Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val 
1105 1110 1115 1120 

Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro 
1125 1130 1135 

Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp 
1140 1145 1150 

Pro Ser His lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly 
1155 1160 1165 
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Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
1170 1175 1180 

Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp 
1185 1190 1195 1200 

Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn lie 
1205 1210 1215 

Thr Arg Val Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Glu 
1220 1225 1230 

Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu 
1235 1240 1245 

lie Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro lie Trp Ala 
1250 1255 1260 

Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp 
1265 1270 1275 1280 

Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala 
1285 1290 1295 

Pro Pro lie Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu 
1300 1305 1310 

Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly 
1315 1320 1325 

Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro 
1330 1335 1340 

Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr 
1345 1350 1355 1360 

Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser 
1365 1370 1375 

Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val 
1380 1385 1390 

Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys 
1395 1400 1405 

Ala Ala Glu Glu Thr Lys Leu Pro He Asn Ala Leu Ser' Asn Ser Leu 
1410 1415 1420 

Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser 
1425 1430 1435 1440 

Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp 
1445 1450 1455 

His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val 
1460 1465 1470 

Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro 
1475 1480 1485 

His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn 
1490 1495 " 1500 

Leu Ser Ser Lys Ala Val Asn His He Arg Ser Val Trp Lys Asp Leu 
1505 1510 1515 1520 
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Leu Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn 
1525 1530 1535 

Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg 
1540 1545 1550 

Leu lie Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala 
1555 1560 1565 

Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser 
1570 1575 1580 

Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn 
1585 1590 1595 1600 

Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg 
1605 1610 1615 

Cys Phe Asp Ser Thr Val Thr Glu Asn Asp. He Arg Val Glu Glu Ser 
1620 1625 1630 

He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg 
1635 ~ 1640 1645 

Ser Leu Thr Glu Arg Leu Tyr He . Gly Gly Pro Leu Thr Asn Ser Lys 
1650 1655 1660 

Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 
1665 1670 1675 1680 

Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala 
1685 1690 1695 



Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp 
1700 1705 1710 

Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu Ala 
1715 1720 1725 



Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro 
1730 1735 1740 

Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys 
1745 1750 1755 1760 

Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr 
1765 1770 1775 

Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu 
1780 1785 1790 

Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met 
1795 1800 1805 

Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe 
1810 1815 1820 

Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin 
1825 1830 1835 1840 

He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He 
1845 1850 1855 

He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser 
1860 1865 1870 



32662.doc 



WO 01/89364 



7 



PCT/US01/16822 



Pro Gly Glu lie Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val 
1875 1880 1885 

Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg 
1890 1895 1900 

Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe 
1905 1910 1915 1920 

Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro lie Pro Ala Ala 
1925 1930 1935 

Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly Gly 
1940 1945 1950 

Asp lie Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp 
1955 1960 1965 

Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn 
1970 1975 1980 



Arg 
1985 



<210> 4 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
1 5 ,10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 



Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
50 ~ 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70. 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 , 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 
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Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu lie Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu lie Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro lie Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

' Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 " 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 



Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 5 
<211> 7987 
<212> DNA 

<213> Hepatitis C virus 
<400> 5 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaggg cgcgccatga 
cggccgcttg ggtggagagg ctattcggct 
ctgatgccgc cgtgttccgg ctgtcagcgc 
acctgtccgg tgccctgaat gaactgcagg 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
ttgaacaaga tggattgcac gcaggttctc 420 
atgactgggc acaacagaca atcggctgct 480 
aggggcgccc ggttcttttt gtcaagaccg 540 
acgaggcagc gcggctatcg tggctggcca 600 
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cgacgggcgt tccttgcgca gctgtgctcg 
tgctattggg cgaagtgccg gggcaggatc 
aagtatccat catggctgat gcaatgcggc 
cattcgacca ccaagcgaaa catcgcatcg 
ttgtcgatca ggatgatctg gacgaagagc 
ccaggctcaa ggcgcgcatg cccgacggcg 
gcttgccgaa tatcatggtg gaaaatggcc 
tgggtgtggc ggaccgctat caggacatag 
ttggcggcga atgggctgac cgcttcctcg 
agcgcatcgc cttctatcgc cttcttgacg 
gtttccctct agcgggatca attccgcccc 
cgaagccgct tggaataagg ccggtgtgcg 
ccgtcttttg gcaatgtgag ggcccggaaa 
aggggtcttt cccctctcgc caaaggaatg 
gttcctctgg aagcttcttg aagacaaaca 
aaccccccac ctggcgacag gtgcctctgc 
gcaaaggcgg cacaacccca gtgccacgtt 
tggctctcct caagcgtatt caacaagggg 
atgggatctg atctggggcc tcggtgcaca 
aacgtctagg ccccccgaac cacggggacg 
atggcgccta ttacggccta ctcccaacag 
agcctcacag gccgggacag gaaccaggtc 
acacaatctt tcctggcgac ctgcgtcaat 
ggctcaaaga cccttgccgg cccaaagggc 
caggacctcg tcggctggca agcgcccccc 
ggcagctcgg acctttactt ggtcacgagg 
ggcgacagca gggggagcct actctccccc 
ggcggtccac tgctctgccc ctcggggcac 
acccgagggg ttgcgaaggc ggtggacttt 
cggtccccgg tcttcacgga caactcgtcc 
gcccatctac acgcccctac tggtagcggc 
gcccaagggt ataaggtgct tgtcctgaac 
gcgtatatgt ctaaggcaca tggtatcgac 
accacgggtg cccccatcac gtactccacc 
tctgggggcg cctatgacat cataatatgt 
atcctgggca tcggcacagt cctggaccaa 
ctcgccaccg ctacgcctcc gggatcggtc 
gctctgtcca gcactggaga aatccccttt 
aaggggggga ggcacctcat tttctgccat 
' aagctgtccg gcctcggact caatgctgta 
ataccaacta gcggagacgt cattgtcgta 
ggcgatttcg actcagtgat cgactgcaat 
ctggacccga ccttcaccat tgagacgacg 
cagcggcgag gcaggactgg taggggcagg 
gaacggccct cgggcatgtt cgattcctcg 
gcttggtacg agctcacgcc cgccgagacc 
ccagggttgc ccgtctgcca ggaccatctg 
acccacatag acgcccattt cttgtcccag 
ctggtagcat accaggctac ggtgtgcgcc 
caaatgtgga agtgtctcat acggctaaag 
tataggctgg gagccgttca aaacgaggtt 
atggcatgca tgtcggctga cctggaggtc 
gtcctagcag ctctggccgc gtattgcctg 
atcatcttgt ccggaaagcc ggccatcatt 
gatgagatgg aagagtgcgc ctcacacctc 
gaacaattca aacagaaggc aatcgggttg 
gctgctcccg tggtggaatc caagtggcgg 
tggaatttca tcagcgggat acaatattta 
gcgatagcat cactgatggc attcacagcc 
accctcctgt ttaacatcct ggggggatgg 
gcttctgctt tcgtaggcgc cggcatcgct 
aaggtgcttg tggatatttt ggcaggttat 
tttaaggtca tgagcggcga gatgccctcc 
atcctctccc ctggcgccct agtcgtcggg 
gtgggcccag gggagggggc tgtgcagtgg 
ggtaaccacg tctcccccac gcactatgtg 



acgttgtcac tgaagcggga agggactggc 660 
tcctgtcatc tcaccttgct cctgccgaga 720 
ggctgcatac gcttgatccg gctacctgcc 780 
agcgagcacg tactcggatg gaagccggtc 840 
atcaggggct cgcgccagcc gaactgttcg 900 
aggatctcgt cgtgacccat ggcgatgcct 960 
gcttttctgg attcatcgac tgtggccggc 1020 
cgttggctac ccgtgatatt gctgaagagc 1080 
tgctttacgg tatcgccgct cccgattcgc 1140 
agttcttctg agtttaaaca gaccacaacg 1200 
tctccctccc ccccccctaa cgttactggc 1260 
tttgtctata tgttattttc caccatattg 1320 
cctggccctg tcttcttgac gagcattcct 1380 
caaggtctgt tgaatgtcgt gaaggaagca 1440 
acgtctgtag cgaccctttg caggcagcgg 1500 
ggccaaaagc cacgtgtata agatacacct 1560 
gtgagttgga tagttgtgga aagagtcaaa 1620 
ctgaaggatg cccagaaggt accccattgt 1680 
tgctttacat gtgtttagtc gaggttaaaa 1740 
tggttttcct ttgaaaaaca cgataatacc 1800 
acgcgaggcc tacttggctg catcatcact 1860 
gagggggagg tccaagtggt ctccaccgca 1920 
ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ccaatcaccc aaatgtacac caatgtggac 2040 
ggggcgcgtt ccttgacacc atgcacctgc 2100 
catgccgatg tcattccggt gcgccggcgg 2160 
aggcccgtct cctacttgaa gggctcttcg 2220 
gctgtgggca tctttcgggc tgccgtgtgc 2280 
gtacccgtcg agtctatgga aaccactatg 2340 
cctccggccg taccgcagac attccaggtg 2400 
aagagcacta aggtgccggc tgcgtatgca 2460 
ccgtccgtcg ccgccaccct aggtttcggg 2520 
cctaacatca gaaccggggt aaggaccatc 2580 
tatggcaagt ttcttgccga cggtggttgc 2640 
gatgagtgcc actcaactga ctcgaccact 2700 
gcggagacgg ctggagcgcg actcgtcgtg 2760 
accgtgccac atccaaacat cgaggaggtg 2820 
tatggcaaag ccatccccat cgagaccatc 2880 
tccaagaaga aatgtgatga gctcgccgcg 2940 
gcatattacc ggggccttga tgtatccgtc 3000 
gcaacggacg ctctaatgac gggctttacc 3060 
acatgtgtca cccagacagt cgacttcagc 3120 
accgtgccac aagacgcggt gtcacgctcg 3180 
atgggcattt acaggtttgt gactccagga 3240 
gttctgtgcg agtgctatga cgcgggctgt 3300 
tcagttaggt tgcgggctta cctaaacaca 3360 
gagttctggg agagcgtctt tacaggcctc 3420 
actaagcagg caggagacaa cttcccctac 3480 
agggctcagg ctccacctcc atcgtgggac 3540 
cctacgctgc acgggccaac gcccctgctg 3600 
actaccacac accccataac caaatacatc 3660 
gtcacgagca cctgggtgct ggtaggcgga 3720 
acaacaggca gcgtggtcat tgtgggcagg 3780 
cccgacaggg aagtccttta ccgggagttc 3840 
ccttacatcg aacagggaat gcagctcgcc 3900 
ctgcaaacag ccaccaagca agcggaggct 3960 
accctcgaag ccttctgggc gaagcatatg 4020 
gcaggcttgt ccactctgcc tggcaacccc 4080 
tctatcacca gcccgctcac cacccaacat 4140 
gtggccgccc aacttgctcc tcccagcgct 4200 
ggagcggctg ttggcagcat aggccttggg 4260 
ggagcagggg tggcaggcgc gctcgtggcc 4320 
accgaggacc tggttaacct actccctgct 4380 
gtcgtgtgcg cagcgatact gcgtcggcac 4440 
atgaaccggc tgatagcgtt cgcttcgcgg 4500 
cctgagagcg acgctgcagc acgtgtcact 4560 
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cagatcctct ctagtcttac catcactcag 
gaggactgct ccacgccatg ctccggctcg 
acggtgttga ctgatttcaa gacctggctc 
gtccccttct tctcatgtca acgtgggtac 
caaaccacct gcccatgtgg agcacagatc 
atcgtggggc ctaggacctg tagtaacacg 
accacgggcc cctgcacgcc ctccccggcg 
gctgctgagg agtacgtgga ggttacgcgg 
accactgaca acgtaaagtg cccgtgtcag 
gatggggtgc ggttgcacag gtacgctcca 
acattcctgg tcgggctcaa tcaatacctg 
ccggacgtag cagtgctcac ttccatgctc 
gctaagcgta ggctggccag gggatctccc 
ctgtctgcgc cttccttgaa ggcaacatgc 
ctcatcgagg ccaacctcct gtggcggcag 
tcagaaaata aggtagtaat tttggactct 
agggaagtat ccgttccggc ggagatcctg 
cccatatggg cacgcccgga ttacaaccct 
tacgtccctc cagtggtaca cgggtgtcca 
cctccacgga ggaagaggac ggttgtcctg 
gagctcgcca caaagacctt cggcagctcc 
acggcctctc ctgaccagcc ctccgacgac 
tcctccatgc ccccccttga gggggagccg 
tctaccgtaa gcgaggaggc tagtgaggac 
acaggcgccc tgatcacgcc atgcgctgcg 
agcaactctt tgctccgtca ccacaacttg 
ctgcggcaga agaaggtcac ctttgacaga 
gtgctcaagg agatgaaggc gaaggcgtcc 
gaagcctgta agctgacgcc cccacattcg 
gacgtccgga acctatccag caaggccgtt 
ctggaagaca ctgagacacc aattgacacc 
gtccaaccag agaagggggg ccgcaagcca 
gttcgtgtgt gcgagaaaat ggccctttac 
atgggctctt catacggatt ccaatactct 
gcctggaaag cgaagaaatg ccctatgggc 
acggtcactg agaatgacat ccgtgttgag 
cccgaagcca gacaggccat aaggtcgctc 
actaattcta aagggcagaa ctgcggctat 
accagctgcg gtaataccct cacatgttac 
aagctccagg actgcacgat gctcgtatgc 
gcggggaccc aagaggacga ggcgagccta 
tctgcccccc ctggggaccc gcccaaacca 
tcctccaatg tgtcagtcgc gcacgatgca 
gaccccacca ccccccttgc gcgggctgcg 
tcctggctag gcaacatcat catgtatgcg 
actcatttct tctccatcct tctagctcag 
atctacgggg cctgttactc cattgagcca 
catggcctta gcgcattttc actccatagt 
tcatgcctca ggaaacttgg ggtaccgccc 
gtccgcgcta ggctactgtc ccaggggggg 
aactgggcag taaggaccaa gctcaaactc 
ttatccagct ggttcgttgc tggttacagc 
gcccgacccc gctggttcat gtggtgccta 
ctactcccca accgatgaac ggggagctaa 
tttccctttt tttttttctt tttttttttt 
tttcctcttt ttttcctttt ctttcctttg 
gctgtgaaag gtccgtgagc cgcttgactg 
atcaagt 



ctgctgaaga ggcttcacca gtggatcaac 4 620 
tggctaagag atgtttggga ttggatatgc 4680 
cagtccaagc tcctgccgcg attgccggga 4740 
aagggagtct ggcggggcga cggcatcatg 4800 
accggacatg tgaaaaacgg ttccatgagg 4860 
tggcatggaa cattccccat taacgcgtac 4920 
ccaaattatt ctagggcgct gtggcgggtg 4980 
gtgggggatt tccactacgt gacgggcatg 5040 
gttccggccc ccgaattctt cacagaagtg 5100 
gcgtgcaaac ccctcctacg ggaggaggtc 5160 
gttgggtcac agctcccatg cgagcccgaa 5220 
accgacccct cccacattac ggcggagacg 5280 
ccctccttgg ccagctcatc agctagccag 5340 
actacccgtc atgactcccc ggacgctgac 5400 
gagatgggcg ggaacatcac ccgcgtggag 5460 
ttcgagccgc tccaagcgga ggaggatgag 5520 
cggaggtcca ggaaattccc tcgagcgatg 5580 
ccactgttag agtcctggaa ggacccggac 5640 
ttgccgcctg ccaaggcccc tccgatacca 5700 
tcagaatcta ccgtgtcttc tgccttggcg 5760 
gaatcgtcgg ccgtcgacag cggcacggca 5820 
ggcgacgcgg gatccgacgt tgagtcgtac 5880 
ggggatcccg atctcagcga cgggtcttgg 5940 
gtcgtctgct gctcgatgtc ctacacatgg 6000 
gaggaaacca agctgcccat caatgcactg 6060 
gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcaggtcc tggacgacca ctaccgggac 6180 
acagttaagg ctaaacttct atccgtggag 6240 
gccagatcta aatttggcta tggggcaaag 6300 
aaccacatcc gctccgtgtg gaaggacttg 6360 
accatcatgg caaaaaatga ggttfctctgc 6420 
gctcgcctta tcgtattccc agatttgggg 6480 
gatgtggtct ccaccctccc tcaggccgtg 6540 
cctggacagc gggtcgagtt cctggtgaat 6600 
ttcgcatatg acacccgctg ttttgactca 6660 
gagtcaatct accaatgttg tgacttggcc 6720 
acagagcggc tttacatcgg gggccccctg 6780 
cgccggtgcc gcgcgagcgg tgtactgacg 6840 
ttgaaggccg ctgcggcctg tcgagctgcg 6900 
ggagacgacc ttgtcgttat ctgtgaaagc 6960 
cgggccttca cggaggctat gactagatac 7020 
gaatacgact tggagttgat aacatcatgc 7080 
tctggcaaaa gggtgtacta tctcacccgt 7140 
tgggagacag ctagacacac tccagtcaat 7200 
cccaccttgt gggcaaggat gatcctgatg 7260 
gaacaacttg aaaaagccct agattgtcag 7320 
cttgacctac ctcagatcat tcaacgactc 7380 
tactctccag gtgagatcaa tagggtggct 7440 
ttgcgagtct ggagacatcg ggccagaagt 7500 
agggctgcca cttgtggcaa gtacctcttc 7560 
actccaatcc cggctgcgtc ccagttggat 7620 
gggggagaca tatatcacag cctgtctcgt 7680 
ctcctacttt ctgtaggggt aggcatctat 7740 
acactccagg ccaataggcc atcctgtttt 7800 
tttttttttt tttttttttt ctcctttttt 7860 
gtggctccat cttagcccta gtcacggcta 7920 
cagagagtgc tgatactggc ctctctgcag 7980 
7987 



<210> 6 
<211> 7989 
<212> DNA 

<213> Hepatitis C virus 



<400> 6 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
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tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg -acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctocccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ' ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
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tggaatttca tcagcgggat acaatattta 
gcgatagcat cactgatggc attcacagcc 
accctcctgt ttaacatcct ggggggatgg 
gcttctgctt tcgtaggcgc cggcatcgct 
aaggtgcttg tggatatttt ggcaggttat 
tttaaggtca tgagcggcga gatgccctcc 
atcctctccc ctggcgccct agtcgtcggg 
gtgggcccag gggagggggc tgtgcagtgg 
ggtaaccacg tctcccccac gcactatgtg 
cagatcctct ctagtcttac catcactcag 
gaggactgct ccacgccatg ctccggctcg 
aoggtgttga ctgatttcaa gacctggctc 
gtccccttct tctcatgtca acgtgggtac 
caaaccacct gcccatgtgg agcacagatc 
atcgtggggc ctaggacctg tagtaacacg 
accacgggcc cctgcacgcc ctccccggcg 
gctgctgagg agtacgtgga ggttacgcgg 
accactgaca acgtaaagtg cccgtgtcag 
gatggggtgc ggttgcacag gtacgctcca 
acattcctgg tcgggctcaa tcaatacctg 
ccggacgtag cagtgctcac ttccatgctc 
gctaagcgta ggctggccag gggatctccc 
ctgtctgcgc cttccttgaa ggcaacatgc 
ctcatcgagg ccaacctcct gtggcggcag 
tcagaaaata aggtagtaat tttggactct 
agggaagtat ccgttccggc ggagatcctg 
cccatatggg cacgcccgga ttacaaccct 
tacgtccctc cagtggtaca cgggtgtcca 
cctccacgga ggaagaggac ggttgtcctg 
gagctcgcca caaagacctt cggcagctcc 
acggcctctc ctgaccagcc ctccgacgac 
tcctccatgc ccccccttga gggggagccg 
tctaccgtaa gcgaggaggc tagtgaggac 
acaggcgccc tgatcacgcc atgcgctgcg 
agcaactctt tgctccgtca ccacaacttg 
ctgcggcaga agaaggtcac ctttgacaga 
gtgctcaagg agatgaaggc gaaggcgtcc 
gaagcctgta agctgacgcc cccacattcg 
gacgtccgga acctatccag caaggccgtt 
ctggaagaca ctgagacacc aattgacacc 
gtccaaccag agaagggggg ccgcaagcca 
gttcgtgtgt gcgagaaaat ggccctttac 
atgggctctt catacggatt ccaatactct 
gcctggaaag cgaagaaatg ccctatgggc 
acggtcactg agaatgacat ccgtgttgag 
cccgaagcca gacaggccat aaggtcgctc 
actaattcta aagggcagaa ctgcggctat 
accagctgcg gtaataccct cacatgttac 
aagctccagg actgcacgat gctcgtatgc 
gcggggaccc aagaggacga ggcgagccta 
■ tctgcccccc ctggggaccc gcccaaacca 
tcctccaatg tgtcagtcgc gcacgatgca 
gaccccacca ccccccttgc gcgggctgcg 
tcctggctag gcaacatcat catgtatgcg 
actcatttct tctccatcct tctagctcag 
atctacgggg cctgttactc cattgagcca 
catggcctta gcgcattttc actccatagt 
tcatgcctca ggaaacttgg ggtaccgccc 
gtccgcgcta ggctactgtc ccaggggggg 
aactgggcag taaggaccaa gctcaaactc 
ttatccagct ggttcgttgc tggttacagc 
gcccgacccc gctggttcat gtggtgccta 
ctactcccca accgatgaac ggggacctaa 
tttccctttt tttttttctt tttttttttt 
tttttcctct ttttttcctt ttctttcctt 
tagctgtgaa aggtccgtga gccgcttgac 



gcaggcttgt ccactctgcc tggcaacccc 4080 
tctatcacca gcccgctcac cacccaacat 4140 
gtggccgccc aacttgctcc tcccagcgct 4200 
ggagcggctg ttggcagcat aggccttggg 4260 
ggagcagggg tggcaggcgc gctcgtggcc 4320 
accgaggacc tggttaacct actccctgct 4380 
gtcgtgtgcg cagcgatact gcgtcggcac 4440 
atgaaccggc tgatagcgtt cgcttcgcgg 4500 
cctgagagcg acgctgcagc acgtgtcact 4560 
ctgctgaaga ggcttcacca gtggatcaac 4620 
tggctaagag atgtttggga ttggatatgc 4680 
cagtccaagc tcctgccgcg attgccggga 4740 
aagggagtct ggcggggcga cggcatcatg 4800 
accggacatg tgaaaaacgg ttccatgagg 4860 
tggcatggaa cattccccat taacgcgtac 4920 
ccaaattatt ctagggcgct gtggcgggtg 4980 
gtgggggatt tccactacgt gacgggcatg 5040 
gttccggccc ccgaattctt cacagaagtg 5100 
gcgtgcaaac ccctcctacg ggaggaggtc 5160 
gttgggtcac agctcccatg cgagcccgaa 5220 
accgacccct cccacattac ggcggagacg 5280 
ccctccttgg ccagctcatc agctagccag 5340 
actacccgtc atgactcccc ggacgctgac 5400 
gagatgggcg ggaacatcac ccgcgtggag 5460 
ttcgagccgc tccaagcgga ggaggatgag 5520 
cggaggtcca ggaaattccc tcgagcgatg 5580 
ccactgttag agtcctggaa ggacccggac 5640 
ttgccgcctg ccaaggcccc tccgatacca 5700 
tcagaatcta ccgtgtcttc tgccttggcg 5760 
gaatcgtcgg ccgtcgacag cggcacggca 5820 
ggcgacgcgg gatccgacgt tgagtcgtac 5880 
ggggatcccg atctcagcga cgggtcttgg 5940 
gtcgtctgct gctcgatgtc ctacacatgg 6000 
gaggaaacca agctgcccat caatgcactg 6060 
gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcaggtcc tggacgacca ctaccgggac 6180 
acagttaagg ctaaacttct atccgtggag 6240 
gccagatcta aatttggcta tggggcaaag 6300 
aaccacatcc gctccgtgtg gaaggacttg 6360 
accatcatgg caaaaaatga ggttttctgc 6420 
gctcgcctta tcgtattccc agatttgggg 6480 
gatgtggtct ccaccctccc tcaggccgtg 6540 
cctggacagc gggtcgagtt cctggtgaat 6600 
ttcgcatatg acacccgctg ttttgactca 6660 
gagtcaatct accaatgttg tgacttggcc 6720 
acagagcggc tttacatcgg gggccccctg 6780 
cgccggtgcc gcgcgagcgg tgtactgacg 6840 
ttgaaggccg ctgcggcctg tcgagctgcg 6900 
ggagacgacc ttgtcgttat ctgtgaaagc 6960 
cgggccttca cggaggctat gactagatac 7020 
gaatacgact tggagttgat aacatcatgc 7080 
tctggcaaaa gggtgtacta tctcacccgt 7140 
tgggagacag ctagacacac tccagtcaat 7200 
cccaccttgt gggcaaggat gatcctgatg 7260 
gaacaacttg aaaaagccct agattgtcag 7320 
cttgacctac ctcagatcat tcaacgactc 7380 
tactctccag gtgagatcaa tagggtggct 7440 
ttgcgagtct ggagacatcg ggccagaagt 7500 
agggctgcca cttgtggcaa gtacctcttc 7560 
actccaatcc cggctgcgtc ccagttggat 7620 
gggggagaca tatatcacag cctgtctcgt 7680 
ctcctacttt ctgtaggggt aggcatctat 7740 
acactccagg ccaataggcc atcctgtttt 7800 
tttttttttt tttttttttt ttctcctttt 7860 
tggtggctcc atcttagccc tagtcacggc 7920 
tgcagagagt gctgatactg gcctctctgc 7980 
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agatcaagt 7989 

<210> 7 
<211> 7848 
<212> DNA 

<213> Hepatitis C virus 
<400> 7 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg .2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc gg'tggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
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ctggtagcat accaggctac ggtgtgcgcc 
caaatgtgga agtgtctcat acggctaaag 
tataggctgg gagccgttca aaacgaggtt 
atggcatgca tgtcggctga cctggaggtc 
gtcctagcag ctctggccgc gtattgcctg 
atcatcttgt ccggaaagcc ggccatcatt 
gatgagatgg aagagtgcgc ctcacacctc 
gaacaattca aacagaaggc aatcgggttg 
gctgctcccg tggtggaatc caagtggcgg 
tggaatttca tcagcgggat acaatattta 
gcgatagcat cactgatggc attcacagcc 
accctcctgt ttaacatcct ggggggatgg 
gcttctgctt tcgtaggcgc cggcatcgct 
aaggtgcttg tggatatttt ggcaggttat 
tttaaggtca tgagcggcga gatgccctcc 
atcctctccc ctggcgccct agtcgtcggg 
gtgggcccag gggagggggc tgtgcagtgg 
ggtaaccacg tctcccccac gcactatgtg 
cagatcctct ctagtcttac catcactcag 
gaggactgct ccacgccatg ctccggctcg 
acggtgttga ctgatttcaa gacctggctc 
gtccccttct tctcatgtca acgtgggtac 
caaaccacct gcccatgtgg agcacagatc 
atcgtggggc ctaggacctg tagtaacacg 
accacgggcc cctgcacgcc ctccccggcg 
gctgctgagg agtacgtgga ggttacgcgg 
accactgaca acgtaaagtg cccgtgtcag 
gatggggtgc ggttgcacag gtacgctcca 
acattcctgg tcgggctcaa tcaatacctg 
ccggacgtag cagtgctcac ttccatgctc 
gctaagcgta ggctggccag gggatctccc 
ctgtactctt tcgagccgct ccaagcggag 
gagatcctgc ggaggtccag gaaattccct 
tacaaccctc cactgttaga gtcctggaag 
gggtgtccat tgccgcctgc caaggcccct 
gttgtcctgt cagaatctac cgtgtcttct 
ggcagctccg aatcgtcggc cgtcgacagc 
tccgacgacg gcgacgcggg atccgacgtt 
ggggagccgg gggatcccga tctcagcgac 
agtgaggacg tcgtctgctg ctcgatgtcc 
tgcgctgcgg aggaaaccaa gctgcccatc 
cacaacttgg tctatgctac aacatctcgc 
tttgacagac tgcaggtcct ggacgaccac 
aaggcgtcca cagttaaggc taaacttcta 
ccacattcgg ccagatctaa atttggctat 
aaggccgtta accacatccg ctccgtgtgg 
attgacacca ccatcatggc aaaaaatgag 
cgcaagccag ctcgccttat cgtattccca 
gccctttacg atgtggtctc caccctccct 
caatactctc ctggacagcg ggtcgagttc 
cctatgggct tcgcatatga cacccgctgt 
cgtgttgagg agtcaatcta ccaatgttgt 
aggtcgctca cagagcggct ttacatcggg 
tgcggctatc gccggtgccg cgcgagcggt 
acatgttact tgaaggccgc tgcggcctgt 
ctcgtatgcg gagacgacct tgtcgttatc 
gcgagcctac gggccttcac ggaggctatg 
cccaaaccag aatacgactt ggagttgata 
cacgatgcat ctggcaaaag ggtgtactat 
cgggctgcgt gggagacagc tagacacact 
atgtatgcgc ccaccttgtg ggcaaggatg 
ctagctcagg aacaacttga aaaagcccta 
attgagccac ttgacctacc tcagatcatt 
ctccatagtt actctccagg tgagatcaat 
gtaccgccct tgcgagtctg gagacatcgg 
caggggggga gggctgccac ttgtggcaag 



agggctcagg ctccacctcc atcgtgggac 3540 
cctacgctgc acgggccaac gcccctgctg 3600 
actaccacac accccataac caaatacatc 3660 
gtcacgagca cctgggtgct ggtaggcgga 3720 
acaacaggca gcgtggtcat tgtgggcagg 3780 
cccgacaggg aagtccttta ccgggagttc 3840 
ccttacatcg aacagggaat gcagctcgcc 3900 
ctgcaaacag ccaccaagca agcggaggct 3960 
accctcgaag ccttctgggc gaagcatatg 4020 
gcaggcttgt ccactctgcc tggcaacccc 4080 
tctatcacca gcccgctcac cacccaacat 4140 
gtggccgccc aacttgctcc tcccagcgct 4200 
ggagcggctg ttggcagcat aggccttggg 4260 
ggagcagggg tggcaggcgc gctcgtggcc 4320 
accgaggacc tggttaacct actccctgct 4380 
gtcgtgtgcg cagcgatact gcgtcggcac 4440 
atgaaccggc tgatagcgtt cgcttcgcgg 4500 
cctgagagcg acgctgcagc acgtgtcact 4560 
ctgctgaaga ggcttcacca gtggatcaac 4620 
tggctaagag atgtttggga ttggatatgc 4680 
cagtccaagc tcctgccgcg attgccggga 4740 
aagggagtct ggcggggcga cggcatcatg 4800 
accggacatg tgaaaaacgg ttccatgagg 4860 
tggcatggaa cattccccat taacgcgtac 4920 
ccaaattatt ctagggcgct gtggcgggtg 4980 
gtgggggatt tccactacgt gacgggcatg 5040 
gttccggccc ccgaattctt cacagaagtg 5100 
gcgtgcaaac ccctcctacg ggaggaggtc 5160 
gttgggtcac agctcccatg cgagcccgaa 5220 
accgacccct cccacattac ggcggagacg 5280 
ccctccttgg ccagctcatc agctagccag 5340 
gaggatgaga gggaagtatc cgttccggcg 5400 
cgagcgatgc ccatatgggc acgcccggat 54 60 
gacccggact acgtccctcc agtggtacac 5520 
ccgataccac ctccacggag gaagaggacg 5580 
gccttggcgg agctcgccac aaagaccttc 5640 
ggcacggcaa cggcctctcc tgaccagccc 5700 
gagtcgtact cctccatgcc cccccttgag 5760 
gggtcttggt ctaccgtaag cgaggaggct 5820 
tacacatgga caggcgccct gatcacgcca 5880 
aatgcactga gcaactcttt gctccgtcac 5940 
agcgcaagcc tgcggcagaa gaaggtcacc 6000 
taccgggacg tgctcaagga gatgaaggcg 6060 
tccgtggagg aagcctgtaa gctgacgccc 6120 
ggggcaaagg acgtccggaa cctatccagc 6180 
aaggacttgc tggaagacac tgagacacca 6240 
gttttctgcg tccaaccaga gaaggggggc 6300 
gatttggggg ttcgtgtgtg cgagaaaatg 6360 
caggccgtga tgggctcttc atacggattc 6420 
ctggtgaatg cctggaaagc gaagaaatgc 6480 
tttgactcaa cggtcactga gaatgacatc 6540 
gacttggccc ccgaagccag acaggccata 6600 
ggccccctga ctaattctaa agggcagaac 6660 
gtactgacga ccagctgcgg taataccctc 6720 
cgagctgcga agctccagga ctgcacgatg 6780 
tgtgaaagcg cggggaccca agaggacgag 6840 
actagatact ctgccccccc tggggacccg 6900 
acatcatgct cctccaatgt gtcagtcgcg 6960 
ctcacccgtg accccaccac cccccttgcg 7020 
ccagtcaatt cctggctagg caacatcatc 7080 
atcctgatga ctcatttctt ctccatcctt 7140 
gattgtcaga tctacggggc ctgttactcc 7200 
caacgactcc atggccttag cgcattttca 7260 
agggtggctt catgcctcag gaaacttggg 7320 
gccagaagtg tccgcgctag gctactgtcc 7380 
tacctcttca actgggcagt aaggaccaag 7 440 
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ctcaaactca ctccaatccc ggctgcgtcc 
ggttacagcg ggggagacat atatcacagc 
tggtgcctac tcctactttc tgtaggggta 
gggacctaaa cactccaggc caataggcca 
tttttttttt tttttttttt tttttttttt 
tctttccttt ggtggctcca tcttagccct 
ccgcttgact gcagagagtg ctgatactgg 



cagttggatt tatccagctg gttcgttgct 7500 
ctgtctcgtg cccgaccccg ctggttcatg 7560 
ggcatctatc tactccccaa ccgatgaacg 7620 
tcctgttttt ttccottttt ttttttcttt 7 680 
tctccttttt ttttcctctt tttttccttt 7740 
agtcacggct agctgtgaaa ggtccgtgag 7800 
cctctctgca gatcaagt 7848 



<210> 8 
<211> 7987 
<212> DNA 

<213> Hepatitis C virus 



<400> 8 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaggg cgcgccatga 
cggccgcttg ggtggagagg ctattcggct 
ctgatgccgc cgtgttccgg ctgtcagcgc 
acctgtccgg tgccctgaat gaactgcagg 
cgacgggcgt tccttgcgca gctgtgctcg 
tgctattggg cgaagtgocg gggcaggatc 
aagtatccat catggctgat gcaatgcggc 
cattcgacca ccaagcgaaa catcgcatcg 
ttgtcgatca ggatgatctg gacgaagagc 
ccaggctcaa ggcgcgcatg cccgacggcg 
gcttgccgaa tatcatggtg gaaaatggcc 
tgggtgtggc ggaccgctat caggacatag 
ttggcggcga atgggctgac cgcttcctcg 
agcgcatcgc cttctatcgc cttcttgacg 
gtttccctct agcgggatca attccgoccc 
cgaagccgct tggaataagg ccggtgtgcg 
ccgtcttttg gcaatgtgag ggcccggaaa 
aggggtcttt cccctctcgc caaaggaatg 
gttcctctgg aagcttcttg aagacaaaca 
aaccccccac ctggcgacag gtgcctctgc 
gcaaaggcgg cacaacccca gtgccacgtt 
tggctctcct caagcgtatt caacaagggg 
atgggatctg atctggggcc tcggtgcaca 
aacgtctagg ccccccgaac cacggggacg 
atggcgccta ttacggccta ctcccaacag 
agcctcacag gccgggacag gaaccaggtc 
acacaatctt tcctggcgac ctgcgtcaat 
ggctcaaaga cccttgccgg cccaaagggc 
caggacctcg tcggctggcg agcgcccccc 
ggcagctcgg acctttactt ggtcacgagg 
ggcgacagca gggggagcct actctccccc 
ggcggtccac tgctctgccc ctcggggcac 
acccgagggg ttgcgaaggc ggtggacttt 
cggtccccgg tcttcacgga caactcgtcc 
gcccatctac acgcccctac tggtagcggc 
gcccaagggt ataaggtgct tgtcctgaac 
gcgtatatgt ctaaggcaca tggtatcgac 
accacgggtg cccccatcac gtactccacc 
tctgggggcg cctatgacat cataatatgt 
atcctgggca tcggcacagt cctggaccaa 
ctcgccaccg ctacgcctcc gggatcggtc 
gctctgtcca gcactggaga aatccccttt 
aaggggggga ggcacctcat tttctgccat 
aagctgtccg gcctcggact caatgctgta 
ataccaacta gcggagacgt cattgtcgta 
ggcgatttcg actcagtgat cgactgcaat 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
ttgaacaaga tggattgcac gcaggttctc 420 
atgactgggc acaacagaca atcggctgct 480 
aggggcgccc ggttcttttt gtcaagaccg 540 
acgaggcagc gcggctatcg tggctggcca 600 
acgttgtcac tgaagcggga agggactggc 660 
tcctgtcatc tcaccttgct cctgccgaga 720 
ggctgcatac gcttgatccg gctacctgcc 780 
agcgagcacg tactcggatg gaagccggtc 840 
atcaggggct cgcgccagcc gaactgttcg 900 
aggatctcgt cgtgacccat ggcgatgcct 960 
gcttttctgg attcatcgac tgtggccggc 1020 
cgttggctac ccgtgatatt gctgaagagc 1080 
tgctttacgg tatcgccgct cccgattcgc 1140 
agttcttctg agtttaaaca gaccacaacg 1200 
tctccctccc ccccccctaa cgttactggc 1260 
tttgtctata tgttattttc caccatattg 1320 
cctggccctg tcttcttgac ga'gcattcct 1380 
caaggtctgt tgaatgtcgt gaaggaagca 1440 
acgtctgtag cgaccctttg caggcagcgg 1500 
ggccaaaagc cacgtgtata agatacacct 1560 
gtgagttgga tagttgtgga aagagtcaaa 1620 
ctgaaggatg cccagaaggt accccattgt 1680 
tgctttacat gtgtttagtc gaggttaaaa 1740 
tggttttcct ttgaaaaaca cgataatacc 1800 
acgcgaggcc tacttggctg catcatcact 1860 
gagggggagg tccaagtggt ctccacegca 1920 
ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ccaatcaccc aaatgtacac caatgtggac 2040 
ggggcgcgtt ccttgacacc atgcacctgc 2100 
catgccgatg tcattccggt gcgccggcgg 2160 
aggcccgtct cctacttgaa gggctcttcg 2220 
gctgtgggca tctttcgggc tgccgtgtgc 2280 
gtacccgtcg agtctatgga aaccactatg 2340 
cctccggccg taccgcagac attccaggtg 2400 
aagagcacta aggtgccggc tgcgtatgca 2460 
ccgtccgtcg ccgccaccct aggtttcggg 2520 
cctaacatca gaaccggggt aaggaccatc 2580 
tatggcaagt ttcttgccga cggtggttgc 2640 
gatgagtgcc actcaactga ctcgaccact 2700 
gcggagacgg ctggagcgcg actcgtcgtg 2760 
accgtgccac atccaaacat cgaggaggtg 2820 
tatggcaaag ccatccccat cgagaccatc 2880 
tccaagaaga aatgtgatga gctcgccgcg 2940 
gcatattacc ggggccttga tgtatccgtc 3000 
gcaacggacg ctctaatgac gggctttacc 3060 
acatgtgtca cccagacagt cgacttcagc 3120 
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ctggacccga ccttcaccat tgagacgacg 
cagcggcgag gcaggactgg taggggcagg 
gaacggccct cgggcatgtt cgattcctcg 
gcttggtacg agctcacgcc cgccgagacc 
ccagggttgc ccgtctgcca ggaccatctg 
acccacatag acgcccattt cttgtcccag 
ctggtagcat accaggctac ggtgtgcgcc 
caaatgtgga agtgtctcat acggctaaag 
tataggctgg gagccgttca aaacgaggtt 
atggcatgca tgtcggctga cctggaggtc 
gtcctagcag ctctggccgc gtattgcctg 
atcatcttgt ccggaaagcc ggccatcatt 
gatgagatgg aagagtgcgc ctcacacctc 
gaacaattca aacagaaggc aatcgggttg 
gctgctcccg tggtggaatc caagtggcgg 
tggaatttca tcagcgggat acaatattta 
gcgatagcat cactgatggc attcacagcc 
accctcctgt ttaacatcct ggggggatgg 
gcttctgctt tcgtaggcgc cggcatcgct 
aaggtgcttg tggatatttt ggcaggttat 
tttaaggtca tgagcggcga gatgccctcc 
atcctctccc ctggcgccct agtcgtcggg 
gtgggcccag gggagggggc tgtgcagtgg 
ggtaaccacg tctcccccac gcactatgtg 
cagatcctct ctagtcttac catcactcag 
gaggactgct ccacgccatg ctccggctcg 
acggtgttga ctgatttcaa gacctggctc 
gtccccttct tctcatgtca acgtgggtac 
caaaccacct gcccatgtgg agcacagatc 
atcgtggggc ctaggacctg tagtaacacg 
accacgggcc cctgcacgcc ctccccggcg 
gctgctgagg agtacgtgga ggttacgcgg 
accactgaca acgtaaagtg cccgtgtcag 
gatggggtgc ggttgcacag gtacgctcca 
acattcctgg tcgggctcaa tcaatacctg 
ccggacgtag cagtgctcac ttccatgctc 
gctaagcgta ggctggccag gggatctccc 
ctgtctgcgc cttccttgaa ggcaacatgc 
c'tcatcgagg ccaacctcct gtggcggcag 
tcagaaaata aggtagtaat tttggactct 
agggaagtat ccgttccggc ggagatcctg 
cccatatggg cacgcccgga ttacaaccct 
tacgtccctc cagtggtaca cgggtgtcca 
cctccacgga ggaagaggac ggttgtcctg 
gagctcgcca caaagacctt cggcagctcc 
acggcctctc ctgaccagcc ctccgacgac 
tcctccatgc ccccccttga gggggagccg 
tctaccgtaa gcgaggaggc tagtgaggac 
acaggcgccc tgatcacgcc atgcgctgcg 
agcaactctt tgctccgtca ccacaacttg 
ctgcggcaga agaaggtcac ctttgacaga 
gtgctcaagg agatgaaggc gaaggcgtcc 
gaagcctgta agctgacgcc cccacattcg 
gacgtccgga acctatccag caaggccgtt 
ctggaagaca ctgagacacc aattgacacc 
gtccaaccag agaagggggg ccgcaagcca 
gttcgtgtgt gcgagaaaat ggccctttac 
atgggctctt catacggatt ccaatactct 
gcctggaaag cgaagaaatg ccctatgggc 
acggtcactg agaatgacat ccgtgttgag 
cccgaagcca gacaggccat aaggtcgctc 
actaattcta aagggcagaa ctgcggctat 
accagctgcg gtaataccct cacatgttac 
aagctccagg actgcacgat gctcgtatgc 
gcggggaccc aagaggacga ggcgagccta 
tctgcccccc ctggggaccc gcccaaacca 



accgtgccac aagacgcggt gtcacgctcg 3180 
atgggcattt acaggtttgt gactccagga 3240 
gttctgtgcg agtgctatga cgcgggctgt 3300 
tcagttaggt tgcgggctta cctaaacaca 3360 
gagttctggg agagcgtctt tacaggcctc 3420 
actaagcagg caggagacaa cttcccctac 3480 
agggctcagg ctccacctcc atcgtgggac 3540 
cctacgctgc acgggccaac gcccctgctg 3600 
actaccacac accccataac caaatacatc 3660 
gtcacgagca cctgggtgct ggtaggcgga 3720 
acaacaggca gcgtggtcat tgtgggcagg 3780 
cccgacaggg aagtccttta ccgggagttc 3840 
ccttacatcg aacagggaat gcagctcgcc 3900 
ctgcaaacag ccaccaagca agcggaggct 3960 
accctcgaag ccttctgggc gaagcatatg 4020 
gcaggcttgt ccactctgcc tggcaacccc 4080 
tctatcacca gcccgctcac cacccaacat 4140 
gtggccgccc aacttgctcc tcccagcgct 4200 
ggagcggctg ttggcagcat aggccttggg 4260 
ggagcagggg tggcaggcgc gctcgtggcc 4320 
accgaggacc tggttaacct actccctgct 4380 
gtcgtgtgcg cagcgatact gcgtcggcac 4440 
atgaaccggc tgatagcgtt cgcttcgcgg 4500 
cctgagagcg acgctgcagc acgtgtcact 4560 
ctgctgaaga ggcttcacca gtggatcaac 4620 
tggctaagag atgtttggga ttggatatgc .4680 
cagtccaagc tcctgccgcg attgccggga 4740 
aagggagtct ggcggggcga cggcatcatg 4800 
accggacatg tgaaaaacgg ttccatgagg 4860 
tggcatggaa cattccccat taacgcgtac 4920 
ccaaattatt ctagggcgct gtggcgggtg 4980 
gtgggggatt tccactacgt gacgggcatg 5040 
gttccggccc ccgaattctt cacagaagtg 5100 
gcgtgcaaac ccctcctacg ggaggaggtc 5160 
gttgggtcac agctcccatg cgagcccgaa 5220 
accgacccct cccacattac ggcggagacg 5280 
ccctccttgg ccagctcatc agctatccag 5340 
actacccgtc atgactcccc ggacgctgac 5400 
gagatgggcg ggaacatcac ccgcgtggag 5460 
ttcgagccgc tccaagcgga ggaggatgag 5520 
cggaggtcca ggaaattccc tcgagcgatg 5580 
ccactgttag agtcctggaa ggacccggac 5640 
ttgccgcctg ccaaggcccc tccgatacca 5700 
tcagaatcta ccgtgtcttc tgccttggcg 5760 
gaatcgtcgg ccgtcgacag cggcacggca 5820 
ggcgacgcgg gatccgacgt tgagtcgtac 5880 
ggggatcccg atctcagcga cgggtcttgg 5940 
gtcgtctgct gctcgatgtc ctacacatgg 6000 
gaggaaacca agctgcccat caatgcactg 6060 
gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcaggtcc tggacgacca ctaccgggac 6180 
acagttaagg ctaaacttct atccgtggag 6240 
gccagatcta aatttggcta tggggcaaag 6300 
aaccacatcc gctccgtgtg gaaggacttg 6360 
accatcatgg caaaaaatga ggttttctgc 6420 
gctcgcctta tcgtattccc agatttgggg 6480 
gatgtggtct ccaccctccc tcaggccgtg 6540 
cctggacagc gggtcgagtt cctggtgaat 6600 
ttcgcatatg acacccgctg ttttgactca 6660 
gagtcaatct accaatgttg tgacttggcc 6720 
acagagcggc tttacatcgg gggccccctg 6780 
cgccggtgcc gcgcgagcgg tgtaotgacg 6840 
ttgaaggccg ctgcggcctg tcgagctgcg 6900 
ggagacgacc ttgtcgttat ctgtgaaagc 6960 
cgggccttca cggaggctat gactagatac 7020 
gaatacgact tggagttgat aacatcatgc 7080 
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tcctccaatg tgtcagtcgc gcacgatgca 

gaccccacca ccccccttgc gcgggctgcg 

tcctggctag gcaacatcat catgtatgcg 

actcatttct tctccatcct tctagctcag 

atctacgggg cctgttactc cattgagcca 

catggcctta gcgcattttc actccatagt 

tcatgcctca ggaaacttgg ggtaccgccc 

gtccgcgcta ggctactgtc ccaggggggg 

aactgggcag taaggaccaa gctcaaactc 

ttatccagct ggttcgttgc tggttacagc 

gcccgacccc gctggttcat gtggtgccta 

ctactcccca accgatgaac ggggagctaa 

tttccctttt tttttttctt tttttttttt 

tttcctcttt ttttcctttt ctttcctttg 
gctgtgaaag gtccgtgagc cgcttgactg 
atcaagt 



tctggcaaaa gggtgtacta tctcacccgt 7140 
tgggagacag ctagacacac tccagtcaat 7200 
cccaccttgt gggcaaggat gatcctgatg 7260 
gaacaaottg aaaaagccct agattgtcag 7320 
cttgacctac ctcagatcat tcaacgactc 7380 
tactctccag gtgagatcaa tagggtggct 7440 
ttgcgagtct ggagacatcg ggccagaagt 7500 
agggctgcca cttgtggcaa gtacctcttc 7560 
actccaatcc cggctgcgtc ccagttggat 7620 
gggggagaca tatatcacag cctgtctcgt 7680 
ctcctacttt ctgtaggggt aggcatctat 7740 
acactccagg ccaataggcc atcctgtttt 7800 
tttttttttt tttttttttt ctcctttttt 7860 
gtggctccat cttagcccta gtcacggcta 7920 
cagagagtgc tgatactggc ctctctgcag 7980 
7987 



<210> 9 
<211> 7989 
<212> DNA 

<213> Hepatitis C virus 



<400> 9 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaggg cgcgccatga 
cggccgcttg ggtggagagg ctattcggct 
ctgatgccgc cgtgttccgg ctgtcagcgc 
acctgtccgg tgccctgaat gaactgcagg 
cgacgggcgt tccttgcgca gctgtgctcg 
tgctattggg cgaagtgccg gggcaggatc 
aagtatccat catggctgat gcaatgcggc 
cattcgacca ccaagcgaaa catcgcatcg 
ttgtcgatca ggatgatctg gacgaagagc 
ccaggctcaa ggcgcgcatg cccgacggcg 
gcttgccgaa tatcatggtg gaaaatggcc 
tgggtgtggc ggaccgctat caggacatag 
ttggcggcga atgggctgac cgcttcctcg 
agcgcatcgc cttctatcgc cttcttgacg 
gtttccctct agcgggatca attccgcccc 
cgaagccgct tggaataagg ccggtgtgcg 
ccgtcttttg gcaatgtgag ggcccggaaa 
aggggtcttt cccctctcgc caaaggaatg 
gttcctctgg aagcttcttg aagacaaaca 
aaccccccac ctggcgacag gtgcctctgc 
gcaaaggcgg cacaacccca gtgccacgtt 
tggctctcct caagcgtatt caacaagggg 
atgggatctg atctggggcc tcggtgcaca 
aacgtctagg ccccccgaac cacggggacg 
atggcgccta ttacggccta ctcccaacag 
agcctcacag gccgggacag gaaccaggtc 
acacaatctt tcctggcgac ctgcgtcaat 
ggctcaaaga cccttgccgg cccaaagggc 
caggacctcg tcggctggca agcgcccccc 
ggcagctcgg acctttactt ggtcacgagg 
ggcgacagca gggggagcct actctccccc 
ggcggtccac tgctctgccc ctcggggcac 
acccgagggg ttgcgaaggc ggtggacttt 
cggtccccgg tcttcacgga caactcgtcc 
gcccatctac acgcccctac tggtagcggc 
gcccaagggt ataaggtgct tgtcctgaac 
gcgtatatgt ctaaggcaca tggtatcgac 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
ttgaacaaga tggattgcac gcaggttctc 420 
atgactgggc acaacagaca atcggctgct 480 
aggggcgccc ggttcttttt gtcaagaccg 540 
acgaggcagc gcggctatcg tggctggcca 600 
acgttgtcac tgaagcggga agggactggc 660 
tcctgtcatc tcaccttgct cctgccgaga 720 
ggctgcatac gcttgatccg gctacctgcc 780 
agcgagcacg tactcggatg gaagccggtc 840 
atcaggggct cgcgccagcc gaactgttcg 900 
aggatctcgt cgtgacccat ggcgatgcct 960 
gcttttctgg attcatcgac tgtggccggc 1020 
cgttggctac ccgtgatatt gctgaagagc 1080 
tgctttacgg tatcgccgct cccgattcgc 1140 
agttcttctg agtttaaaca gaccacaacg 1200 
tctccctccc ccccccctaa cgttactggc 1260 
tttgtctata tgttattttc caccatattg 1320 
cctggccctg tcttcttgac gagcattcct 1380 
caaggtctgt tgaatgtcgt gaaggaagca 1440 
acgtctgtag cgaccctttg caggcagcgg 1500 
ggccaaaagc cacgtgtata agatacacct 1560 
gtgagttgga tagttgtgga aagagtcaaa 1620 
ctgaaggatg cccagaaggt accccattgt 1680 
tgctttacat gtgtttagtc gaggttaaaa 1740 
tggttttcct ttgaaaaaca cgataatacc 1800 
acgcgaggcc tacttggctg catcatcact 1860 
gagggggagg tccaagtggt ctccaccgca 1920 
ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ccaatcaccc aaatgtacac caatgtggac 2040 
ggggcgcgtt ccttgacacc atgcacctgc 2100 
catgccgatg tcattccggt gcgccggcgg 2160 
aggcccgtct cctacttgaa gggctcttcg 2220 
gctgtgggca tctttcgggc tgccgtgtgc 2280 
gtacccgtcg agtctatgga aaccactatg 2340 
cctccggccg taccgcagac attccaggtg 2400 
a-agagcacta aggtgccggc tgcgtatgca 2460 
ccgtccgtcg ccgccaccct aggtttcggg 2520 
cctaacatca gaaccggggt aaggaccatc 2580 
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accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtggg agtgtctcat acggctaaag cctacgctgc acgggccaac , gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctggtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgtg ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg. 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 57 60 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
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atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggcog ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggacctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tt.tttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 
tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 
tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7 980 
agatcaagt 7989 

<210> 10 
<211> 7989 
<212> DNA 

<213> Hepatitis C virus 
<400> 10 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccotg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
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caggacctcg tcggctggca agcgcccccc 
ggcagctcgg acctttactt ggtcacgagg 
ggcgacagca gggggagcct actctccccc 
ggcggtccac tgctctgccc ctcggggcac 
acccgagggg ttgcgaaggc ggtggacttt 
cggtccccgg tcttcacgga caactcgtcc 
gcccatctac acgcccctac tggtagcggc 
gcccaagggt ataaggtgct tgtcctgaac 
gcgtatatgt ctaaggcaca tggtatcgac 
accacgggtg cccccatcac gtactccacc 
tctgggggcg cctatgacat cataatatgt 
atcctgggca tcggcacagt cctggaccaa 
ctcgccaccg ctacgcctcc gggatcggtc 
gctctgtcca gcactggaga aatccccttt 
aaggggggga ggcacctcat tttctgccat 
aagctgtccg gcctcggact caatgctgta 
at'accaacta gcggagacgt cattgtcgta 
ggcgatttcg actcagtgat cgactgcaat 
ctggacccga ccttcaccat tgagacgacg 
cagcggcgag gcaggactgg taggggcagg 
gaacggccct cgggcatgtt cgattcctcg 
gcttggtacg agctcacgcc cgccgagacc 
ccagggttgc ccgtctgcca ggaccatctg 
acccacatag acgcccattt cttgtcccag 
ctggtagcat accaggctac ggtgtgcgcc 
caaatgtgga agtgtctcat acggctaaag 
tataggctgg gagccgttca aaacgaggtt 
atggcatgca tgtcggctga cctggaggtc 
gtcctagcag ctctggccgc gtattgcctg 
atcatcttgt ccggaaagcc ggccatcatt 
gatgagatgg aagagtgcgc ctcacacctc 
gaacaattca aacagaaggc aatcgggttg 
gctgctcccg tggtggaatc caagtggcgg 
tggaatttca tcagcgggat acaatattta 
gcgatagcat cactgatggc attcacagcc 
accctcctgt ttaacatcct ggggggatgg 
gcttctgctt tcgtaggcgc cggcatcgct 
aaggtgcttg tggatatttt ggcaggttat 
tttaaggtca tgagcggcga gatgccctcc 
atcctctccc ctggcgccct agtcgtcggg 
gtgggcccag gggagggggc tgtgcagtgg 
ggtaaccacg tctcccccac gcactatgtg 
cagatcctct ctagtcttac catcactcag 
gaggactgct ccacgccatg ctccggctcg 
acggtgttga ctgatttcaa gacctggctc 
gtccccttct tctcatgtca acgtgggtac 
caaaccacct gcccatgtgg agcacagatc 
atcgtggggc ctaggacctg tagtaacacg 
accacgggcc cctgcacgcc ctccccggcg 
gctgctgagg agtacgtgga ggttacgcgg 
accactgaca acgtaaagtg cccgtgtcag 
gatggggtgc ggttgcacag gtacgct'cca 
acattcctgg tcgggctcaa tcaatacctg 
ccggacgtag cagtgctcac ttccatgctc 
gctaagcgta ggctggccag gggatctccc 
ctgtctgcgc cttccttgaa ggcaacatgc 
ctcatcgagg ccaacctcct gtggcggcag 
tcagaaaata aggtagtaat tttggactct 
agggaagtat ccgttccggc ggagatcctg 
cccatatggg cacgcccgga ttacaaccct 
tacgtccctc cagtggtaca cgggtgtcca 
cctccacgga ggaagaggac ggttgtcctg 
gagctcgcca caaagacctt cggcagctcc 
acggcctctc ctgaccagcc ctccgacgac 
tcctccatgc ccccccttga gggggagccg 
tctaccgtaa gcgaggaggc tagtgaggac 



ggggcgcgtt ccttgacacc atgcacctgc 2100 
catgccgatg tcattccggt gcgccggcgg 2160 
aggcccgtct cctacttgaa gggctcttcg 2220 
gctgtgggca tctttcgggc tgccgtgtgc 2280 
gtacccgtcg agtctatgga aaccactatg 2340 
cctccggccg taccgcagac attccaggtg 2400 
aagagcacta aggtgccggc tgcgtatgca 2460 
ccgtccgtcg ccgccaccct aggtttcggg 2520 
cctaacatca gaaccggggt aaggaccatc 2580 
tatggcaagt ttcttgccga cggtggttgc 2640 
gatgagtgcc actcaactga ctcgaccact 2700 
gcggagacgg ctggagcgcg actcgtcgtg 2760 
accgtgccac atccaaacat cgaggaggtg 2820 
tatggcaaag ccatccccat cgagaccatc 2880 
tccaagaaga aatgtgatga gctcgccgcg 2940 
gcatattacc ggggccttga tgtatccgtc 3000 
gcaacggacg ctctaatgac gggctttacc 3060 
acatgtgtca cccagacagt cgacttcagc 3120 
accgtgccac aagacgcggt gtcacgctcg 3180 
atgggcattt acaggtttgt gactccagga 3240 
gttctgtgcg agtgctatga cgcgggctgt 3300 
tcagttaggt tgcgggctta cctaaacaca 3360 
gagttctggg agagcgtctt tacaggcctc 3420 
actaagcagg caggagacaa cttcccctac 3480 
agggctcagg ctccacctcc atcgtgggac 3540 
cctacgctgc acgggccaac gcccctgctg 3600 
actaccacac accccataac caaatacatc 3660 
gtcacgagca cctgggtgct ggtaggcgga 3720 
acaacaggca gcgtggtcat tgtgggcagg 3780 
cccgacaggg aagtccttta ccgggagttc 3840 
ccttacatcg aacagggaat gcagctcgcc 3900 
ctgcaaacag ccaccaagca agcggaggct 3960 
accctcgaag ccttctgggc gaagcatatg 4020 
gcaggcttgt ccactctgcc tggcaacccc 4080 
tctatcacca gcccgctcac cacccaacat 4140 
gtggccgccc aacttgctcc tcccagcgct 4200 
ggagcggctg ttggcagcat aggccttggg 4260 
ggagcagggg tggcaggcgc gctcgtggcc 4320 
accgaggacc tggttaacct actccctgct 4380 
gtcgtgtgcg cagcgatact gcgtcggcac 4440 
atgaaccggc tgatagcgtt cgcttcgcgg 4500 
cctgagagcg acgctgcagc acgtgtcact 4560 
ctgctgaaga ggcttcacca gtggatcaac 4620 
tggctaagag atgtttggga ttggatatgc 4680 
cagtccaagc tcctgccgcg attgccggga 4740 
aagggagtct ggcggggcga cggcatcatg 4800 
accggacatg tgaaaaacgg ttccatgagg 4860 
tggcatggaa cattccccat taacgcgtac 4920 
ccaaattatt ctagggcgct gtggcgggtg 4980 
gtgggggatt tccactacgt gacgggcatg 5040 
gttccggccc ccgaattctt cacagaagtg 5100 
gcgtgcaaac ccctcctacg ggaggaggtc 5160 
gttgggtcac agctcccatg cgagcccgaa 5220 
accgacccct cccacattac ggcggagacg 5280 
ccctccttgt ccagctcatc agctagccag 5340 
actacccgtc atgactcccc ggacgctgac 5400 
gagatgggcg ggaacatcac ccgcgtggag 5460 
ttcgagccgc tccaagcgga ggaggatgag 5520 
cggaggtcca ggaaattccc tcgagcgatg 5580 
ccactgttag agtcctggaa ggacccggac 5640 
ttgccgcctg ccaaggcccc tccgatacca 5700 
tcagaatcta ccgtgtcttc tgccttggcg 5760 
gaatcgtcgg ccgtcgacag cggcacggca 5820 
ggcgacgcgg gatccgacgt tgagtcgtac 5880 
ggggatcccg atctcagcga cgggtcttgg 5940 
gtcgtctgct gctcgatgtc ctacacatgg 6000 
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acaggcgccc tgatcacgcc atgcgctgcg 
agcaactctt tgctccgtca ccacaacttg 
ctgcggcaga agaaggtcac ctttgacaga 
gtgctcaagg agatgaaggc gaaggcgtcc 
gaagcctgta agctgacgcc cccacattcg 
gacgtccgga acctatccag caaggccgtt 
ctggaagaca ctgagacacc aattgacacc 
gtccaaccag agaagggggg ccgcaagcca 
gttcgtgtgt gcgagaaaat ggccctttac 
atgggctctt catacggatt ccaatactct 
gcctggaaag cgaagaaatg ccctatgggc 
acggtcactg agaatgacat ccgtgttgag 
cccgaagcca gacaggccat aaggtcgctc 
actaattcta aagggcagaa ctgcggctat 
accagctgcg gtaataccct cacatgttac 
aagctccagg actgcacgat gctcgtatgc 
gcggggaccc aagaggacga ggcgagccta 
tctgcccccc ctggggaccc gcccaaacca 
tcctccaatg tgtcagtcgc gcacgatgca 
gaocccacca ccccccttgc gcgggctgcg 
tcctggctag gcaacatcat catgtatgcg 
actcatttct tctccatcct tctagctcag 
atctacgggg cctgttactc cattgagcca 
catggcctta gcgcattttc actccatagt 
tcatgcctca ggaaacttgg ggtaccgccc 
gtccgcgcta ggctactgtc ccaggggggg 
aactgggcag taaggaccaa gctcaaactc 
ttatccagct ggttcgttgc tggttacagc 
gcccgacccc gctggttcat gtggtgccta 
ctactcccca accgatgaac ggggacctaa 
tttccctttt tttttttctt tttttttttt 
tttttcctct ttttttcctt ttctttcctt 
tagctgtgaa aggtccgtga gccgcttgac 
agatcaajgt 

<210> 11 
<211> 1341 
<212> DNA 

<213> Hepatitis C virus 
<400> 11 

tccggctcgt ggctaagaga tgtttgggat 
acctggctcc agtccaagct cctgccgcga 
cgtgggtaca agggagtctg gcggggcgac 
gcacagatca ccggacatgt gaaaaacggt 
agtaacacgt ggcatggaac attccccatt 
tccccggcgc caaattattc tagggcgctg 
gttacgcggg tgggggattt ccactacgtg 
ccgtgtcagg ttccggcccc cgaattcttc 
tacgctccag cgtgcaaacc cctcctacgg 
caatacctgg ttgggtcaca gctcccatgc 
tccatgctca ccgacccctc ccacattacg 
ggatctcccc cctgcttggc cagctcatca 
gcaacatgca ctacccgtca tgactccccg 
tggcggcagg agatgggcgg gaacatcacc 
ttggactctt tcgagccgct ccaagcggag 
gagatcctgc ggaggtccag gaaattccct 
tacaaccctc cactgttaga gtcctggaag 
gggtgtccat tgccgcctgc caaggcccct 
gttgtcctgt cagaatctac cgtgtcttct 
ggcagctccg aatcgtcggc cgtcgacagc 
tccgacgacg gcgacgcggg atccgacgtt 
ggggagccgg gggatcccga tctcagcgac 
agtgaggacg tcgtctgctg c 

<210> 12 



21 



gaggaaacca agctgcccat caatgcactg 6060 
gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcaggtcc tggacgacca ctaccgggac 6180 
acagttaagg ctaaacttct atccgtggag 6240 
gccagatcta aatttggcta tggggcaaag 6300 
aaccacatcc gctccgtgtg gaaggacttg 6360 
accatcatgg caaaaaatga ggttttctgc 6420 
gctcgcctta tcgtattccc agatttgggg 6480 
gatgtggtct ccaccctccc tcaggccgtg 6540 
cctggacagc gggtcgagtt cctggtgaat 6600 
ttcgcatatg acacccgctg ttttgactca 6660 
gagtcaatct accaatgttg tgacttggcc 6720 
acagagcggc tttacatcgg gggccccctg 6780 
cgccggtgcc gcgcgagcgg tgtactgacg 6840 
ttgaaggccg ctgcggcctg tcgagctgcg 6900 
ggagacgacc ttgtcgttat ctgtgaaagc 6960 
cgggccttca cggaggctat gactagatac 7020 
gaatacgact tggagttgat aacatcatgc 7080 
tctggcaaaa gggtgtacta tctcacccgt 7140 
tgggagacag ctagacacac tccagtcaat 7200 
cccaccttgt gggcaaggat gatcctgatg 7260 
gaacaacttg aaaaagccct agattgtcag 7320 
cttgacctac ctcagatcat tcaacgactc 7380 
tactctccag gtgagatcaa tagggtggct 7440 
ttgcgagtct ggagacatcg ggccagaagt 7500 
agggctgcca cttgtggcaa gtacctcttc 7560 
actccaatcc cggctgcgtc ccagttggat 7620 
gggggagaca tatatcacag cctgtctcgt 7 680 
ctcctacttt ctgtaggggt aggcatctat 7740 
acactccagg ccaataggcc atcctgtttt 7800 
tttttttttt tttttttttt ttctcctttt 7860 
tggtggctcc atcttagccc tagtcacggc 7920 
tgcagagagt gctgatactg gcctctctgc 7980 
7989 



tggatatgca cggtgttgac tgatttcaag 60 
ttgccgggag tccccttctt ctcatgtcaa 120 
ggcatcatgc aaaccacctg cccatgtgga 180 
tccatgagga tcgtggggcc taggacctgt 240 
aacgcgtaca ccacgggccc ctgcacgccc 300 
tggcgggtgg ctgctgagga gtacgtggag 360 
acgggcatga ccactgacaa cgtaaagtgc 420 
acagaagtgg atggggtgcg gttgcacagg 480 
gaggaggtca cattcctggt cgggctcaat 540 
gagcccgaac cggacgtagc agtgctcact 600 
gcggagacgg ctaagcgtag gctggccagg 660 
gctagccagc tgtctgcgcc ttccttgaag 720 
gacgctgacc tcatcgaggc caacctcctg 780 
cgcgtggagt cagaaaataa ggtagtaatt 840 
gaggatgaga gggaagtatc cgttccggcg 900 
cgagcgatgc ccatatgggc acgcccggat 960 
gacccggact acgtccctcc agtggtacac 1020 
ccgataccac ctccacggag gaagaggacg 1080 
gccttggcgg agctcgccac aaagaccttc 1140 
ggcacggcaa cggcctctcc tgaccagccc 1200 
gagtcgtact cctccatgcc cccccttgag 1260 
gggtcttggt ctaccgtaag cgaggaggct 1320 
1341 
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<211> 1341 
<212> DNA 

<213> Hepatitis C virus 
<400> 12 

tccggctcgt ggctaagaga tgtttgggat tggatatgca cggtgttgac tgatttcaag 60 
acctggctcc agtccaagct cctgccgcga ttgccgggag tccccttctt ctcatgtcaa 120 
cgtgggtaca agggagtctg gcggggcgac ggcatcatgc aaaccacctg cccatgtgga 180 
gcacagatca ccggacatgt gaaaaacggt tccatgagga tcgtggggcc taggacctgt 240 
agtaacacgt ggcatggaac attccccatt aacgcgtaca ccacgggccc ctgcacgccc 300 
tccccggcgc caaattattc tagggcgctg tggcgggtgg ctgctgagga gtacgtggag 360 
gttacgcggg tgggggattt ccactacgtg acgggcatga ccactgacaa cgtaaagtgc 420 
ccgtgtcagg ttccggcccc cgaattcttc acagaagtgg atggggtgcg gttgcacagg 480 
tacgctccag cgtgcaaacc cctcctacgg gaggaggtca cattcctggt cgggctcaat 540 
caatacctgg ttgggtcaca gctcccatgc gagcccgaac cggacgtagc agtgctcact 600 
tccatgctca ccgacccctc ccacattacg gcggagacgg ctaagcgtag gctggccagg 660 
ggatctcccc cccccttggc cagctcatca gctagccagc tgtctgcgcc ttccttgaag 720 
gcaacatgca ctacccgtca tgactccccg gacgctgacc tcatcgaggc caacctcctg 780 
tggcggcagg agatgggcgg gaacatcacc cgcgtggagt cagaaaataa ggtagtaatt 840 
ttggactctt tcgagccgct ccaagcggag gaggatgaga gggaagtatc cgttccggcg 900 
gagatcctgc ggaggtccag gaaattccct cgagcgatgc ccatatgggc acgcccggat 960 
tacaaccctc cactgttaga gtcctggaag gacccggact acgtccctcc agtggtacac 1020 
gggtgtccat tgccgcctgc caaggcccct ccgataccac ctccacggag gaagaggacg 1080 
gttgtcctgt cagaatctac cgtgtcttct gccttggcgg agctcgccac aaagaccttc 1140 
ggcagctccg aatcgtcggc cgtcgacagc ggcacggcaa cggcctctcc tgaccagccc 1200 
tccgacgacg gcgacgcggg atccgacgtt gagtcgtact cctccatgcc cccccttgag 1260 
ggggagccgg gggatcccga tctcagcgac gggtcttggt ctaccgtaag cgaggaggct 1320 
agtgaggacg tcgtctgctg c 1341 

<210> 13 
<211> 7987 
<212> DNA 

<213> Hepatitis C virus 
<400> 13 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cocgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
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agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccocat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag. ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctatccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
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acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggcegtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actcca'atcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7 680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ctcctttttt 7860 
tttcctcttt ttttcctttt ctttcctttg gtggctccat cttagcccta gtcacggcta 7920 
gctgtgaaag gtccgtgagc cgcttgactg cagagagtgc tgatactggc ctctctgcag 7980 
atcaagt 7987 

<210> 14 
<211> 400 
<212> PRT 

<213> Hepatitis C virus 
<400> 14 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp lie Cys Thr Val Leu 
1 5 10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin. Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 " 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr. Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 
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Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 



Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Tyr Ser Phe Glu Pro Leu 
225 230 235 240 

Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu lie Leu 
245 250 255 

Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro lie Trp Ala Arg Pro 
260 265 270 

Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val 
275 280 285 

Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro 
290 295 300 



Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser 
325 330 335 

Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin 
340 345 350 

Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser 
355 360 365 

Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly 
370 375 380 

Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
385 390 395 400 



<210> 15 
<211> 1985 
<212> PRT 

<213> Hepatitis C virus 
<400> 15 

Met Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 
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Cys lie lie Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 
35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 
50 55 60 

Leu Ala Gly Pro Lys Gly Pro lie Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Arg Ala Pro Pro Gly Ala Arg Ser Leu Thr 
85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 ' 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg' Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
165 170 175 

Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro 
180 185 190 

Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr Gly 
195 200 205 

Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr 
210 215 220 

Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
225 230 235 240 

Ala Tyr Met Ser Lys Ala His Gly lie Asp Pro Asn He Arg Thr Gly 
245 250 255 

Val Arg Thr He Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr Gly 
260 265 270 

Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He 
275 280 285 

He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly He 
290 295 300 

Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val 
305 310 315 320 

Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 
325 330 335 

He Glu Glu Val Ala Leu Ser Ser Thr Gly Glu He Pro Phe Tyr Gly 
340 345 " 350 

Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He Phe 
355 360 365 
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Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser Gly 
370 375 380 

Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val 
385 ~ 390 395 400 

lie Pro Thr Ser Gly Asp Val lie Val Val Ala Thr Asp Ala Leu Met 
405 410 415 

Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr Cys 
420 425 430 

Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu 
435 440 445 

Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
450 455 460 

Arg Thr Gly Arg Gly Arg Met Gly He Tyr Arg Phe Val Thr Pro Gly 
465 470 475 - 480 

Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr 
485 490 495 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser Val 
500 505 510 



His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He Asp 
530 535 540 



Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
545 550 555 560 



Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
565 570 575 



Glu Val Thr Thr Thr His Pro He Thr Lys Tyr He Met Ala Cys Met 
610 615 620 



Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly 
625 630 635 640 



Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val 
645 650 655 



He Val Gly Arg He He Leu Ser Gly Lys Pro Ala He He Pro Asp 
660 665 • 670 



Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
675 680 685 



Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala 
705 710 715 720 
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Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe Trp 
725 730 735 

Ala Lys His Met Trp Asn Phe lie Ser Gly lie Gin Tyr Leu Ala Gly 
740 745 750 

Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala Phe 
755 760 765 

Thr Ala Ser lie Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu Phe 
770 775 780 

Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala 
785 790 795 800 

Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser 
805 810 815 

He Gly Leu Gly Lys Val Leu Val Asp lie Leu Ala Gly Tyr Gly Ala 
820 825 830 

Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met 
835 840 845 

Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro 
850 855 860 

Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 
865 870 875 880 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
885 • 890 895 

Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu 
900 905 910 

Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr He 
915 920 925 

Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser 
930 935 940 

Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys 
945 950 955 960 

Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro 
965 970 975 

Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly 
980 985 990 

Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala 
995 1000 1005 

Gin He Thr Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro 
1010 1015 1020 

Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr 
1025 1030 1035 1040 

Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala 
1045 1050 1055 

Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly 
1060 1065 1070 
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Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro 
1075 1080 1085 

Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg 
1090 1095 1100 

Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val 
1105 1110 1115 1120 

Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro 
1125 1130 1135 

Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp 
1140 1145 1150 

Pro Ser His lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly 
1155 1160 1165 

Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala lie Gin Leu Ser Ala Pro 
1170 1175 1180 

Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp 
1185 1190 1195 1200 

Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn lie 
1205 1210 1215 

Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu 
1220 1225 1230 

Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu 
1235 1240 1245 

He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala 
1250 1255 1260 

Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp 
1265 1270 1275 1280 

Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala 
1285 1290 1295 

Pro Pro He Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu 
1300 1305 1310 

Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly 
1315 1320 1325 

Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro 
1330 1335 1340 

Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr 
1345 1350 1355 1360 

Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser 
1365 1370 1375 

Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val 
1380 1385 1390 

Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys 
1395 1400 1405 

Ala Ala Glu Glu Thr Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu 
1410 1415 1420 
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Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser 
1425 1430 1435 1440 

Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp 
1445 1450 1455 

His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val 
1460 1465 1470 

Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro 
1475 1480 1485 

His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn 
1490 1495 1500 

Leu Ser Ser Lys Ala Val Asn His lie Arg Ser Val Trp Lys Asp Leu 
1505 1510 1515 1520 

Leu Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn 
1525 1530 • 1535 

Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg 
1540 1545 1550 



Leu lie Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala 
1555 1560 1565 



Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser 
1570 1575 1580 

Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn 
1585 1590 1595 1600 

Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg 
1605 1610 1615 

Cys Phe Asp Ser Thr Val Thr Glu Asn Asp lie Arg- Val Glu Glu Ser 
1620 1625 1630 

lie Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala lie Arg 
1635 1640 1645 

Ser Leu Thr Glu Arg Leu Tyr lie Gly Gly Pro Leu Thr Asn Ser Lys 
1650 1655 1660 

Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 
1665 " 1670 1675 1680 

Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala 
1685 1690 1695 

Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp 
1700 1705 1710 

Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu Ala 
1715 1720 1725 

Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro 
1730 1735 1740 

Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys 
1745 1750 1755 1760 

Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr 
1765 1770 1775 
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Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu 
1780 1785 1790 

Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met 
1795 1800 1805 

Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe 
1810 1815 1820 

Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin 
1825 1830 1835 1840 

He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He 
1845 1850 1855 

He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser 
1860 1865 1870 

Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val 
1875 1880 1885 

Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg 
1890 1895 1900 

Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe 
1905 1910 1915 ' " 1920 

Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala 
1925 1930 1935 

Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly Gly 
1940 1945 1950 

Asp He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp 
1955 1960 1965 

Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn 
1970 1975 1980 



<210> 16 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 16 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
15 10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 



Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 80 
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Ser Asn Thr Trp His Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly 



Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 



lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala lie Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu lie Glu 
245 250 255 



Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp- Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 
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Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 17 
<211> 1985 
<212> PRT 

<213> Hepatitis C virus 
<400> 17 

Met Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
1 5 10 15 

Cys He He Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 
35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 
50 55 60 

Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu Thr 
85 90 ' 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 135 . 140 

Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
145 150 155 ' 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
165 170 175 

Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro 
180 185 190 

Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr Gly 
195 200 205 

Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr 
210 215 220 

Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
225 230 235 240 

Ala Tyr Met Ser Lys Ala His Gly He Asp Pro Asn He Arg Thr Gly 
245 250 255 

Val Arg Thr He Thr Thr Gly Ala Pro lie Thr Tyr Ser Thr Tyr Gly 
260 265 ~ 270 

Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He 
275 280 285 

He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly He 
290 295 300 
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Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val 
305 310 315 320 

Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 
325 330 335 

He Glu Glu Val Ala Leu Ser Ser Thr Gly Glu He Pro Phe Tyr Gly 
340 345 " 350 

Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He Phe 
355 360 365 

Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser Gly 
370 375 380 

Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val 
385 390 395 400 

He Pro Thr Ser Gly Asp Val He Val Val Ala Thr Asp Ala Leu Met 
405 410 ~ 415 

Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr Cys 
420 425 430 

Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu 
435 440 445 

Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
450 455 460 

Arg Thr Gly Arg Gly Arg Met Gly He Tyr Arg Phe Val Thr Pro Gly 
465 47(5' 475 480 

Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr 
485 490 495 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser Val 
500 505 510 

Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
515 520 525 

His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He Asp 
530 535 540 

Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
545 550 555 560 

Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
565 570 575 

Pro Ser Trp Asp Gin Met Trp Glu Cys Leu He Arg Leu Lys Pro Thr 
580 585 590 

Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn 
595 600 605 

Glu Val Thr Thr Thr His Pro He Thr Lys Tyr He Met Ala Cys Met 
610 615 620 

Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly 
625 630 635 640 

Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val 
645 650 655 
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lie Val Gly Arg lie He Leu Ser Gly Lys Pro Ala He He Pro Asp 
660 665 670 

Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
675 680 685 

His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys 
690 695 700 

Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala 
705 710 715 720 

Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe Trp 
725 730 735 

Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
740 745 750 

Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala Phe 
755 760 765 

Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu Phe 
770 775 780 

Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala 
785 790 795 800 

Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser 
805 810 815 

He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
820 825 830 

Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met 
835 840 845 



Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 
865 870 875 880 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
885 890 895 

Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu 
900 905 910 

Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Gly Leu Thr He 
915 920 925 

Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser 
930 935 940 

Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys 
945 950 955 960 

Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro 
965 970 975 

Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly 
980 985 990 

Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala 
995 1000 1005 
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Gin He Thr Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro 
1010 1015 1020 

Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr 
1025 1030 1035 1040 

Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala 
1045 1050 1055 

Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly 
1060 1065 1070 

Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro 
1075 1080 1085 

Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg 
1090 1095 1100 

Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val 
1105 " 1110 - 1115 " 1120 

Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro 
1125 1130 1135 

Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp 
1140 1145 1150 

Pro Ser His He Thr Ala Glu Thr Ala Lys Arg Gly Leu Ala Arg Gly 
1155 1160 1165 



Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
1170 1175 1180 

Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp 
1185 1190 1195 1200 

Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He 
1205 1210 1215 

Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu 
1220 1225 1230 

Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu 
1235 1240 1245 

He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala 
1250 1255 1260 

Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp 
1265 1270 1275 1280 

Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala 
1285 1290 1295 

Pro Pro He Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu 
1300 1305 1310 

Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly 
1315 1320 1325 

Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro 
1330 1335 1340 

Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr 
1345 1350 1355 1360 
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Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser 
1365 1370 1375 

Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val 
1380 1385 1390 

Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys 
1395 1400 " 1405 

Ala Ala Glu Glu Thr Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu 
1410 1415 1420 

Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser 
1425 1430 1435 1440 

Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp 
1445 1450 1455 

His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val 
1460 1465 1470 



His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn 
1490 1495 1500 

Leu Ser Ser Lys Ala Val Asn His He Arg Ser Val Trp Lys Asp Leu 
1505 1510 1515 1520 

Leu Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys Asn 
1525 1530 1535 

Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg 
1540 1545 1550 

Leu He Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala 
1555 1560 1565 

Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser 
1570 * 1575 1580 

Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn 
1585 1590 1595 1600 

Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg 
1605 1610 1615 

Cys Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser 
1620 1625 1630 

He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg 
1635 1640 1645 

Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys 
1650 1655 1660 

Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 
1665 1670 1675 1680 

Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala 
1685 . 1690 1695 
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Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu Ala 
1715 1720 1725 

Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro 
1730 1735 1740 

Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys 
1745 1750 1755 1760 

Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr 
1765 1770 1775 

Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu 
1780 1785 1790 

Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met 
1795 1800 1805 

Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe 
1810 1815 1820 

Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin 
1825 1830 1835 1840 

He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He 
1845 1850 1855 

He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser 
1860 1865 1870 

Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val 
1875 1880 1885 

Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg 
1890 1895 1900 

Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe 
1905 1910 1915 1920 

Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala 
1925 1930 1935 



Asp He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp 
1955 1960 1965 

Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn 
1970 1975 1980 



<210> 18 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 18 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
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Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly lie Met Gin Thr Thr Cys Pro Cys Gly Ala Gin lie Thr 
50 ' 55 60 

Gly His Val Lys Asn Gly Ser Met Arg lie Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Gly Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu lie Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn lie 'Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu lie Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro lie Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro lie 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 
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Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 * 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 19 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 19 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp lie Cys Thr Val Leu 



' Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly lie Met Gin Thr Thr Cys Pro Cys Gly Ala Gin lie Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ser Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 
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Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu lie Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 " 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 



<210> 20 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 20 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
15 10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 



435 



440 



445 



85 



90 



95 
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Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 " 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 " 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Cys Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu lie Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 
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<210> 21 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 21 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp lie Cys Thr Val Leu 



Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 

20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 

35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 

50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 



Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 



He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Pro Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu He Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 
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Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro lie 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
3-70 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 ' 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 22 
<211> 7789 
<212> DNA 

<213> Hepatitis C virus 
<400> 22 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctagcgggat caattccgcc cctctccctc 
cttggaataa ggccggtgtg cgtttgtcta 
tggcaatgtg agggcccgga aacctggccc 
ttcccctctc gccaaaggaa tgcaaggtct 
ggaagcttct tgaagacaaa caacgtctgt 
acctggcgac aggtgcctct gcggccaaaa 
ggcacaaccc cagtgccacg ttgtgagttg 
ctcaagcgta ttcaacaagg ggctgaagga 
tgatctgggg cctcggtgca catgctttac 
ggccccccga accacgggga cgtggttttc 
ggagatggca gcatcgtgcg gaggcgcggt 
accgcactat aagctgttcc tcgctaggct 
ggccgaggca cacttgcaag tgtggatccc 
cgtcatcctc ctcacgtgcg cgatccaccc 
gctcgccata ctcggtccac tcatggtgct 
cgtgcgcgca cacgggctca ttcgtgcatg 
ttatgtccaa atggctctca tgaagttggc 
tctcacccca ctgcgggact gggcccacgc 
gcccgtcgtc ttctctgata tggagaccaa 
gtgtggggac atcatcttgg gcctgcccgt 
gggaccggca gacagccttg aagggcaggg 
ctcccaacag acgcgaggcc tacttggctg 
gaaccaggtc gagggggagg tccaagtggt 
ctgcgtcaat ggcgtgtgtt ggactgtcta 
cccaaagggc ccaatcaccc aaatgtacac 
agcgcccccc ggggcgcgtt ccttgacacc 
ggtcacgagg catgccgatg tcattccggt 
actctccccc aggcccgtct cctacttgaa 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac cagaccacaa cggtttccct 360 
ccccccccct aacgttactg gccgaagccg 420 
tatgttattt tccaccatat tgccgtcttt 480 
tgtcttcttg acgagcattc ctaggggtct 540 
gttgaatgtc gtgaaggaag cagttcctct 600 
agcgaccctt tgcaggcagc ggaacccccc 660 
gccacgtgta taagatacac ctgcaaaggc 720 
gatagttgtg gaaagagtca aatggctctc 780 
tgcccagaag gtaccccatt gtatgggatc 840 
atgtgtttag tcgaggttaa aaaacgtcta 900 
ctttgaaaaa cacgataata ccatggaccg 960 
tttcgtaggt ctgatactct tgaccttgtc 1020 
catatggtgg ttacaatatt ttatcaccag 1080 
ccccctcaac gttcgggggg gccgcgatgc 1140 
agagctaatc tttaccatca ccaaaatctt 1200 
ccaggctggt ataaccaaag tgccgtactt 1260 
catgctggtg cggaaggttg ctgggggtca 1320 
cgcactgaca ggtacgtacg tttatgacca 1380 
gggcctacga gaccttgcgg tggcagttga 1440 
ggttatcacc tggggggcag acaccgcggc 1500 
ctccgcccgc agggggaggg agatacatct 1560 
gtggcgactc ctcgcgccta ttacggccta 1620 
catcatcact agcctcacag gccgggacag 1680 
ctccaccgca acacaatctt tcctggcgac 1740 
tcatggtgcc ggctcaaaga cccttgccgg 1800 
caatgtggac caggacctcg tcggctggca 1860 
atgcacctgc ggcagctcgg acctttactt 1920 
gcgccggcgg ggcgacagca gggggagcct 1980 
gggctcttcg ggcggtccac tgctctgccc 20.40 
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ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc acccgagggg ttgcgaaggc 2100 
ggtggacttt gtacccgtcg agtctatgga aaccactatg cggtccccgg tcttcacgga 2160 
caactcgtcc cctccggccg taccgcagac attccaggtg gcccatctac acgcccctac 2220 
tggtagcggc aagagcacta aggtgccggc tgcgtatgca gcccaagggt ataaggtgct 2280 
tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg gcgtatatgt ctaaggcaca 2340 
tggtatcgac cctaacatca gaaccggggt aaggaccatc accacgggtg cccccatcac 2400 
gtactccacc tatggcaagt ttcttgccga cggtggttgc tctgggggcg cctatgacat 2460 
cataatatgt gatgagtgcc actcaactga ctcgaccact atcctgggca tcggcacagt 2520 
cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg ctcgccaccg ctacgcctcc 2580 
gggatcggtc accgtgccac atccaaacat cgaggaggtg gctctgtcca gcactggaga 2640 
aatccccttt tatggcaaag ccatccccat cgagaccatc aaggggggga ggcacctcat 2700 
tttctgccat tccaagaaga aatgtgatga gctcgccgcg aagctgtccg gcctcggact 27 60 
caatgctgta gcatattacc ggggccttga tgtatccgtc ataccaacta gcggagacgt 2820 
cattgtcgta gcaacggacg ctctaatgac gggctttacc ggcgatttcg actcagtgat 2880 
cgactgcaat acatgtgtca cccagacagt cgacttcagc ctggacccga ccttcaccat 2940 
tgagacgacg accgtgccac aagacgcggt gtcacgctcg cagcggcgag gcaggactgg 3000 
taggggcagg atgggcattt acaggtttgt gactccagga gaacggccct cgggcatgtt 3060 
cgattcctcg gttctgtgcg agtgctatga cgcgggctgt gcttggtacg agctcacgcc 3120 
cgccgagacc tcagttaggt tgcgggctta cctaaacaca ccagggttgc ccgtctgcca 3180 
ggaccatctg gagttctggg agagcgtctt tacaggcctc acccacatag acgcccattt 3240 
cttgtcccag actaagcagg caggagacaa cttcccctac ctggtagcat accaggctac 3300 
ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac caaatgtgga agtgtctcat 3360 
acggctaaag cctacgctgc acgggccaac gcccctgctg tataggctgg gagccgttca 3420 
aaacgaggtt actaccacac accccataac caaatacatc atggcatgca tgtcggctga 3480 
cctggaggtc gtcacgagca cctgggtgct ggtaggcgga gtcctagcag ctctggccgc 3540 
gtattgcctg acaacaggca gcgtggtcat tgtgggcagg atcatcttgt ccggaaagcc 3600 
ggccatcatt cccgacaggg aagtccttta ccgggagttc gatgagatgg aagagtgcgc 3660 
ctcacacctc ccttacatcg aacagggaat gcagctcgcc gaacaattca aacagaaggc 3720 
aatcgggttg ctgcaaacag ccaccaagca agcggaggct gctgctcccg tggtggaatc 3780 
caagtggcgg accctcgaag ccttctgggc gaagcatatg tggaatttca tcagcgggat 3840 
acaatattta gcaggcttgt ccactctgcc tggcaacccc gcgatagcat cactgatggc 3900 
attcacagcc tctatcacca gcccgctcac cacccaacat accctcctgt ttaacatcct 3960 
ggggggatgg gtggccgccc aacttgctcc tcccagcgct gcttctgctt tcgtaggcgc 4020 
cggcatcgct ggagcggctg ttggcagcat aggccttggg aaggtgcttg tggatatttt 4080 
ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc tttaaggtca tgagcggcga 4140 
gatgccctcc accgaggacc tggttaacct actccctgct atcctctccc ctggcgccct 4200 
agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac gtgggcccag gggagggggc 4260 
tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg ggtaaccacg tctcccccac 4320 
gcactatgtg cctgagagcg acgctgcagc acgtgtcact cagatcctct ctagtcttac 4380 
catcactcag ctgctgaaga ggcttcacca gtggatcaac gaggactgct ccacgccatg 4440 
ctccggctcg tggctaagag atgtttggga ttggatatgc acggtgttga ctgatttcaa 4500 
gacctggctc cagtccaagc tcctgccgcg attgccggga gtccccttct tctcatgtca 4560 
acgtgggtac aagggagtct ggcggggcga cggcatcatg caaaccacct gcccatgtgg 4620 
agcacagatc accggacatg tgaaaaacgg ttccatgagg atcgtggggc ctaggacctg 4680 
tagtaacacg tggcatggaa cattccccat taacgcgtac accacgggcc cctgcacgcc 4740 
ctccccggcg ccaaattatt ctagggcgct gtggcgggtg gctgctgagg agtacgtgga 4800 
ggttacgcgg gtgggggatt tccactacgt gacgggcatg accactgaca acgtaaagtg 4860 
cccgtgtcag gttccggccc ccgaattctt cacagaagtg gatggggtgc ggttgcacag 4920 
gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc acattcctgg tcgggctcaa 4980 
tcaatacctg gttgggtcac agctcccatg cgagcccgaa ccggacgtag cagtgctcac 5040 
ttccatgctc accgacccct cccacattac ggcggagacg gctaagcgta ggctggccag 5100 
gggatctccc ccctccttgg ccagctcatc agctatccag ctgtctgcgc cttccttgaa 5160 
ggcaacatgc actacccgtc atgactcccc ggacgctgac ctcatcgagg ccaacctcct 5220 
gtggcggcag gagatgggcg ggaacatcac ccgcgtggag tcagaaaata aggtagtaat 5280 
tttggactct ttcgagccgc tccaagcgga ggaggatgag agggaagtat ccgttccggc 5340 
ggagatcctg cggaggtcca ggaaattccc tcgagcgatg cccatatggg cacgcccgga 5400 
ttacaaccct ccactgttag agtcctggaa ggacccggac tacgtccctc cagtggtaca 5460 
cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca cctccacgga ggaagaggac 5520 
ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg gagctcgcca caaagacctt 5580 
cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca acggcctctc ctgaccagcc 5640 
ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac tcctccatgc ccccccttga 5700 
gggggagccg ggggatcccg atctcagcga cgggtcttgg tctaccgtaa gcgaggaggc 5760 
tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg acaggcgccc tgatcacgcc 5820 
atgcgctgcg gaggaaacca agctgcccat caatgcactg agcaactctt tgctccgtca 5880 
ccacaacttg gtctatgcta caacatctcg cagcgcaagc ctgcggcaga agaaggtcac 5940 
ctttgacaga ctgcaggtcc tggacgacca ctaccgggac gtgctcaagg agatgaaggc 6000 
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gaaggcgtcc acagttaagg ctaaacttct atccgtggag gaagcctgta agctgacgcc 6060 
cccacattcg gccagatcta aatttggcta tggggcaaag gacgtccgga acctatccag 6120 
caaggccgtt aaccacatcc gctccgtgtg gaaggacttg ctggaagaca ctgagacacc 6180 
aattgacacc accatcatgg caaaaaatga ggttttctgc gtccaaccag agaagggggg 6240 
ccgcaagcca gctcgcctta tcgtattccc agatttgggg gttcgtgtgt gcgagaaaat 6300 
ggccctttac gatgtggtct ccaccctccc tcaggccgtg atgggctctt catacggatt 6360 
ccaatactct cctggacagc gggtcgagtt cctggtgaat gcctggaaag cgaagaaatg 6420 
ccctatgggc ttcgcatatg acacccgctg ttttgactca acggtcactg agaatgacat 6480 
ccgtgttgag gagtcaatct accaatgttg tgacttggcc cccgaagcca gacaggccat 6540 
aaggtcgctc acagagcggc tttacatcgg gggccccctg actaattcta aagggcagaa 6600 
ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg accagctgcg gtaataccct 6660 
cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg aagctccagg actgcacgat 6720 
gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc gcggggaccc aagaggacga 6780 
ggcgagccta cgggccttca cggaggctat gactagatac tctgcccccc ctggggaccc 6840 
gcccaaacca gaatacgact tggagttgat aacatcatgc tcctccaatg tgtcagtcgc 6900 
gcacgatgca tctggcaaaa gggtgtacta tctcacccgt gaccccacca ccccccttgc 6960 
gcgggctgcg tgggagacag ctagacacac tccagtcaat tcctggctag gcaacatcat 7020 
catgtatgcg cccaccttgt gggcaaggat gatcctgatg actcatttct tctccatcct 7080 
tctagctcag gaacaacttg aaaaagccct agattgtcag atctacgggg cctgttactc 7140 
cattgagcca cttgacctac ctcagatcat tcaacgactc catggcctta gcgcattttc 7200 
actccatagt tactctccag gtgagatcaa tagggtggct tcatgcctca ggaaacttgg 7260 
ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt gtccgcgcta ggctactgtc 7320 
ccaggggggg agggctgcca cttgtggcaa gtacctcttc aactgggcag taaggaccaa 7380 
gctcaaactc actccaatcc cggctgcgtc ccagttggat ttatccagct ggttcgttgc 7440 
tggttacagc gggggagaca tatatcacag cctgtctcgt gcccgacccc gctggttcat 7500 
gtggtgccta ctcctacttt ctgtaggggt aggcatctat ctactcccca accgatgaac 7560 
ggggacctaa acactccagg ccaataggcc atcctgtttt tttccctttt tttttttctt 7620 
tttttttttt tttttttttt tttttttttt ttctcctttt tttttcctct ttttttcctt 7680 
ttctttcctt tggtggctcc atcttagccc tagtcacggc tagctgtgaa aggtccgtga 7740 
gccgcttgac tgcagagagt gctgatactg gcctctctgc agatcaagt 7789 

<210> 23 
<211> 11062 
<212> DNA 

<213> Hepatitis C virus 
<400> 23 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgac'cggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
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aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataataat 1800 
gagcacgaat cctaaacctc aaagaaaaac caaacgtaac accaaccgcc gcccacagga 1860 
cgtcaagttc ccgggcggtg gtcagatcgt cggtggagtt tacctgttgc cgcgcagggg 1920 
ccccaggttg ggtgtgcgcg cgactaggaa gacttccgag cggtcgcaac ctcgtggaag 1980 
gcgacaacct atccccaagg ctcgccagcc cgagggtagg gcctgggctc agcccgggta 2040 
cccctggccc ctctatggca atgagggctt ggggtgggca ggatggctcc tgtcaccccg 2100 
tggctctcgg cctagttggg gccccacgga cccccggcgt aggtcgcgca atttgggtaa 2160 
ggtcatcgat accctcacgt gcggcttcgc cgatctcatg gggtacattc cgctcgtcgg 2220 
cgccccccta gggggcgctg ccagggccct ggcgcatggc gtccgggttc tggaggacgg 2280 
cgtgaactat gcaacaggga atctgcccgg ttgctccttt tctatcttcc ttttggcttt 2340 
gctgtcctgt ttgaccatcc cagcttccgc ttatgaagtg cgcaacgtat ccggagtgta 2400 
ccatgtcacg aacgactgct ccaacgcaag cattgtgtat gaggcagcgg acatgatcat 2460 
gcataccccc gggtgcgtgc cctgcgttcg ggagaacaac tcctcccgct gctgggtagc 2520 
gctcactccc acgctcgcgg ccaggaacgc tagcgtcccc actacgacga tacgacgcca 2580 
tgtcgatttg ctcgttgggg cggctgctct ctgctccgct atgtacgtgg gagatctctg 2640 
cggatctgtt ttcctcgtcg cccagctgtt caccttctcg cctcgccggc acgagacagt 2700 
acaggactgc aattgctcaa tatatcccgg ccacgtgaca ggtcaccgta tggcttggga 2760 
tatgatgatg aactggtcac ctacagcagc cctagtggta tcgcagttac tccggatccc 2820 
acaagctgtc gtggatatgg tggcgggggc ccattgggga gtcctagcgg gccttgccta 2880 
ctattccatg gtggggaact gggctaaggt tctgattgtg atgctactct ttgccggcgt 2940 
tgacggggga acctatgtga caggggggac gatggccaaa aacaccctcg ggattacgtc 3000 
cctcttttca cccgggtcat cccagaaaat ccagcttgta aacaccaacg gcagctggca 3060 
catcaacagg actgccctga actgcaatga ctccctcaac actgggttcc ttgctgcgct 3120 
gttctacgtg cacaagttca actcatctgg atgcccagag cgcatggcca gctgcagccc 3180 
catcgacgcg ttcgctcagg ggtgggggcc catcacttac aatgagtcac acagctcgga 3240 
ccagaggcct tattgttggc actacgcacc ccggccgtgc ggtatcgtac ccgcggcgca 3300 
ggtgtgtggt ccagtgtact gcttcacccc aagccctgtc gtggtgggga cgaccgaccg 3360 
gttcggcgtc cctacgtaca gttgggggga gaatgagacg gacgtgctgc ttcttaaoaa 3420 
cacgcggccg ccgcaaggca actggtttgg ctgtacatgg atgaatagca ctgggttcac 3480 
caagacgtgc gggggccccc cgtgtaacat cggggggatc ggcaataaaa ccttgacctg 3540 
ccccacggac tgcttccgga agcaccccga ggccacttac accaagtgtg gttcggggcc 3600 
ttggttgaca cccagatgct tggtccacta cccatacagg ctttggcact acccctgcac 3660 
tgtcaacttt accatcttca aggttaggat gtacgtgggg ggagtggagc acaggctcga 3720 
agccgcatgc aattggactc gaggagagcg ttgtaacctg gaggacaggg acagatcaga 3780 
gcttagcccg ctgctgctgt ctacaacgga gtggcaggta ttgccctgtt ccttcaccac 3840 
cctaccggct ctgtccactg gtttgatcca tctccatcag aacgtcgtgg acgtacaata 3900 
cctgtacggt atagggtcgg cggttgtctc ctttgcaatc aaatgggagt atgtcctgtt 3960 
gctcttcctt cttctggcgg acgcgcgcgt ctgtgcctgc ttgtggatga tgctgctgat 4020 
agctcaagct gaggccgccc tagagaacct ggtggtcctc aacgcggcat ccgtggccgg 4080 
ggcgcatggc attctctcct tcctcgtgtt cttctgtgct gcctggtaca tcaagggcag 4140 
gctggtccct ggggcggcat atgccctcta cggcgtatgg ccgctactcc tgctcctgct 4200 
ggcgttacca ccacgagcat acgccatgga ccgggagatg gcagcatcgt gcggaggcgc 4260 
ggttttcgta ggtctgatac tcttgacctt gtcaccgcac tataagctgt tcctcgctag 4320 
gctcatatgg tggttacaat attttatcac cagggccgag gcacacttgc aagtgtggat 4380 
cccccccctc aacgttcggg ggggccgcga tgccgtcatc ctcctcacgt gcgcgatcca 4440 
cccagagcta atctttacca tcaccaaaat cttgctcgcc atactcggtc cactcatggt 4500 
gctccaggct ggtataacca aagtgccgta cttcgtgcgc gcacacgggc tcattcgtgc 4560 
atgcatgctg gtgcggaagg ttgctggggg tcattatgtc caaatggctc tcatgaagtt 4620 
ggccgcactg acaggtacgt acgtttatga ccatctcacc ccactgcggg actgggccca 4680 
cgcgggccta cgagaccttg cggtggcagt tgagcccgtc gtcttctctg atatggagac 4740 
caaggttatc acctgggggg cagacaccgc ggcgtgtggg gacatcatct tgggcctgcc 4800 
cgtctccgcc cgcaggggga gggagataca tctgggaccg gcagacagcc ttgaagggca 4860 
ggggtggcga ctcctcgcgc ctattacggc ctactcccaa cagacgcgag gcctacttgg 4920 
ctgcatcatc actagcctca caggccggga caggaaccag gtcgaggggg aggtccaagt 4980 
ggtctccacc gcaacacaat ctttcctggc gacctgcgtc aatggcgtgt gttggactgt 5040 
ctatcatggt gccggctcaa agacccttgc cggcccaaag ggcccaatca cccaaatgta 5100 
caccaatgtg gaccaggacc tcgtcggctg gcaagcgccc cccggggcgc gttccttgac 5160 
accatgcacc tgcggcagct cggaccttta cttggtcacg aggcatgccg atgtcattcc 5220 
ggtgcgccgg cggggcgaca gcagggggag cctactctcc cccaggcccg tctcctactt 5280 
gaagggctct tcgggcggtc cactgctctg cccctcgggg cacgctgtgg gcatctttcg 5340 
ggctgccgtg tgcacccgag gggttgcgaa ggcggtggac tttgtacccg tcgagtctat 5400 
ggaaaccact atgcggtccc cggtcttcac ggacaactcg tcccctccgg ccgtaccgca 5460 
gacattccag gtggcccatc tacacgcccc tactggtagc ggcaagagca ctaaggtgcc 5520 
ggctgcgtat gcagcccaag ggtataaggt gcttgtcctg aacccgtccg tcgccgccac 5580 
cctaggtttc ggggcgtata tgtctaaggc acatggtatc gaccctaaca tcagaaccgg 5640 
ggtaaggacc atcaccacgg gtgcccccat cacgtactcc acctatggca agtttcttgc 5700 
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cgacggtggt tgctctgggg gcgcctatga 
tgactcgacc actatcctgg gcatcggcac 
gcgactcgtc gtgctcgcca ccgctacgcc 
catcgaggag gtggctctgt ccagcactgg 
catcgagacc atcaaggggg ggaggcacct 
tgagctcgcc gcgaagctgt ccggcctcgg 
tgatgtatcc gtcatacoaa ctagcggaga 
gacgggcttt accggcgatt tcgactcagt 
agtcgacttc agcctggacc cgaccttcac 
ggtgtcacgc tcgcagcggc gaggcaggac 
tgtgactcca ggagaacggc cctcgggcat 
tgacgcgggc tgtgcttggt acgagctcac 
ttacctaaac acaccagggt tgcccgtctg 
ctttacaggc ctcacccaca tagacgccca 
caacttcccc tacctggtag cataccaggc 
tccatcgtgg gaccaaatgt ggaagtgtct 
aacgcccctg ctgtataggc tgggagccgt 
aaccaaatac atcatggcat gcatgtcggc 
gctggtaggc ggagtcctag cagctctggc 
cattgtgggc aggatcatct tgtccggaaa 
ttaccgggag ttcgatgaga tggaagagtg 
aatgcagctc gccgaacaat tcaaacagaa 
gcaagcggag gctgctgctc ccgtggtgga 
ggcgaagcat atgtggaatt tcatcagcgg 
gcctggcaac cccgcgatag catcactgat 
caccacccaa cataccctcc tgtttaacat 
tcctcccagc gctgcttctg ctttcgtagg 
cataggcctt gggaaggtgc ttgtggatat 
cgcgctcgtg gcctttaagg tcatgagcgg 
cctactccct gctatcctct cccctggcgc 
actgcgtcgg cacgtgggcc caggggaggg 
gttcgcttcg cggggtaacc acgtctcccc 
agcacgtgtc actcagatcc tctctagtct 
ccagtggatc aacgaggact gctccacgcc 
ggattggata tgcacggtgt tgactgattt 
gcgattgccg ggagtcccct tcttctcatg 
cgacggcatc atgcaaacca cctgcccatg 
cggttccatg aggatcgtgg ggcctaggac 
cattaacgcg tacaccacgg gcccctgcac 
gctgtggcgg gtggctgctg aggagtacgt 
cgtgacgggc atgaccactg acaacgtaaa 
cttcacagaa gtggatgggg tgcggttgca 
acgggaggag gtcacattcc tggtcgggct 
atgcgagccc gaaccggacg tagcagtgct 
tacggcggag acggctaagc gtaggctggc 
atcagctatc cagctgtctg cgccttcctt 
cccggacgct gacctcatcg aggccaacct 
cacccgcgtg gagtcagaaa ataaggtagt 
ggaggaggat gagagggaag tatccgttcc 
ccctcgagcg atgcccatat gggcacgccc 
gaaggacccg gactacgtcc ctccagtggt 
ccctccgata ccacctccac ggaggaagag 
ttctgccttg gcggagctcg ccacaaagac 
cagcggcacg gcaacggcct ctcctgacca 
cgttgagtcg tactcctcca tgccccccct 
ogacgggtct tggtctaccg taagcgagga 
gtcctacaca tggacaggcg ccctgatcac 
catcaatgca ctgagcaact ctttgctccg 
tcgcagcgca agcctgcggc agaagaaggt 
ccactaccgg gacgtgctca aggagatgaa 
tctatccgtg gaggaagcct gtaagctgac 
ctatggggca aaggacgtcc ggaacctatc 
gtggaaggac ttgctggaag acactgagac 
tgaggttttc tgcgtccaac cagagaaggg 
cccagatttg ggggttcgtg tgtgcgagaa 
ccctcaggcc gtgatgggct cttcatacgg 



catcataata tgtgatgagt gccactcaac 5760 
agtcctggac caagcggaga cggctggagc 5820 
tccgggatcg gtcaccgtgc cacatccaaa 5880 
agaaatcccc ttttatggca aagccatccc 5940 
cattttctgc cattccaaga agaaatgtga 6000 
actcaatgct gtagcatatt accggggcct 6060 
cgtcattgtc gtagcaacgg acgctctaat 6120 
gatcgactgc aatacatgtg tcacccagac 6180 
cattgagacg acgaccgtgc cacaagacgc 6240 
tggtaggggc aggatgggca tttacaggtt 6300 
gttcgattcc tcggttctgt gcgagtgcta 6360 
gcccgccgag acctcagtta ggttgcgggc 6420 
ccaggaccat ctggagttct gggagagcgt 6480 
tttcttgtcc cagactaagc aggcaggaga 6540 
tacggtgtgc gccagggctc aggctccacc 6600 
catacggcta aagcctacgc tgcacgggcc 6660 
tcaaaacgag gttactacca cacaccccat 6720 
tgacctggag gtcgtcacga gcacctgggt 6780 
cgcgtattgc ctgacaacag gcagcgtggt 6840 
gccggccatc attcccgaca gggaagtcct 6900 
cgcctcacac ctcccttaca tcgaacaggg 6960 
ggcaatcggg ttgctgcaaa cagccaccaa 7020 
atccaagtgg cggaccctcg aagccttctg 7080 
gatacaatat ttagcaggct tgtccactct 7140 
ggcattcaca gcctctatca ccagcccgct 7200 
cctgggggga tgggtggccg cccaacttgc 7260 
cgccggcatc gctggagcgg ctgttggcag 7320 
tttggcaggt tatggagcag gggtggcagg 7380 
cgagatgccc tccaccgagg acctggttaa 7440 
cctagtcgtc ggggtcgtgt gcgcagcgat 7500 
ggctgtgcag tggatgaacc ggctgatagc 7560 
cacgcactat gtgcctgaga gcgacgctgc 7620 
taccatcact cagctgctga agaggcttca 7680 
atgctccggc tcgtggctaa gagatgtttg 7740 
caagacctgg ctccagtcca agctcctgcc 7800 
tcaacgtggg tacaagggag tctggcgggg 7860 
tggagcacag atcaccggac atgtgaaaaa 7920 
ctgtagtaac acgtggcatg gaacattccc 7980 
gccctccccg gcgccaaatt attctagggc 8040 
ggaggttacg cgggtggggg atttccacta 8100 
gtgcccgtgt caggttccgg cccccgaatt 8160 
caggtacgct ccagcgtgca aacccctcct 8220 
caatcaatac ctggttgggt cacagctccc 8280 
cacttccatg ctcaccgacc cctcccacat 8340 
caggggatct cccccctcct tggccagctc 8400 
gaaggcaaca tgcactaccc gtcatgactc 8460 
cctgtggcgg caggagatgg gcgggaacat 8520 
aattttggac tctttcgagc cgctccaagc 8580 
ggcggagatc ctgcggaggt ccaggaaatt 8640 
ggattacaac cctccactgt tagagtcctg 8700 
acacgggtgt ccattgccgc ctgccaaggc 8760 
gacggttgtc ctgtcagaat ctaccgtgtc 8820 
cttcggcagc tccgaatcgt cggccgtcga 8880 
gccctccgac gacggcgacg cgggatccga 8940 
tgagggggag ccgggggatc ccgatctcag 9000 
ggctagtgag gacgtcgtct gctgctcgat 9060 
gccatgcgct gcggaggaaa ccaagctgcc 9120 
tcaccacaac ttggtctatg ctacaacatc 9180 
cacctttgac agactgcagg tcctggacga 9240 
ggcgaaggcg tccacagtta aggctaaact 9300 
gcccccacat tcggccagat ctaaatttgg 9360 
cagcaaggcc gttaaccaca tccgctccgt 9420 
accaattgac accaccatca tggcaaaaaa 9480 
gggccgcaag ccagctcgcc ttatcgtatt 9540 
aatggccctt tacgatgtgg tctccaccct 9600 
attccaatac tctcctggac agcgggtcga 9660 
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gttcctggtg aatgcctgga aagcgaagaa 

ctgttttgac tcaacggtca ctgagaatga 

ttgtgacttg gcccccgaag ccagacaggc 

cgggggcccc ctgactaatt ctaaagggca 

cggtgtactg acgaccagct gcggtaatac 

ctgtcgagct gcgaagctcc aggactgcac 

tatctgtgaa agcgcgggga cccaagagga 

tatgactaga tactctgccc cccctgggga 

gataacatca tgctcctcca atgtgtcagt 

ctatctcacc cgtgacccca ccacccccct 

cactccagtc aattcctggc taggcaacat 

gatgatcctg atgactcatt tcttctccat 

cctagattgt cagatctacg gggcctgtta 

cattcaacga ctccatggcc ttagcgcatt 

caatagggtg gcttcatgcc tcaggaaact 

tcgggccaga agtgtccgcg ctaggctact 

caagtacctc ttcaactggg cagtaaggac 

gtcccagttg gatttatcca gctggttcgt 

cagcctgtct cgtgcccgac cccgctggtt 

ggtaggcatc tatctactcc ccaaccgatg 

gccatcctgt ttttttccct tttttttttt 

tttttctcct ttttttttcc tctttttttc 

ccctagtcac ggctagctgt gaaaggtccg 

ctggcctctc tgcagatcaa gt 



atgccctatg ggcttcgcat atgacacccg 9720 
catccgtgtt gaggagtcaa tctaccaatg 9780 
cataaggtcg ctcacagagc ggctttacat 9840 
gaactgcggc tatcgccggt gccgcgcgag 9900 
cctcacatgt tacttgaagg ccgctgcggc 9960 
gatgctcgta tgcggagacg accttgtcgt 10020 
cgaggcgagc ctacgggcct tcacggaggc 10080 
cccgcccaaa ccagaatacg acttggagtt 10140 
cgcgcacgat gcatctggca aaagggtgta 10200 
tgcgcgggct gcgtgggaga cagctagaca 10260 
catcatgtat gcgcccacct tgtgggcaag 10320 
ccttctagct caggaacaac ttgaaaaagc 10380 
ctccattgag ccacttgacc tacctcagat 10440 
ttcactccat agttactctc caggtgagat 10500 
tggggtaccg cccttgcgag tctggagaca 10560 
gtcccagggg gggagggctg ccacttgtgg 10620 
caagctcaaa ctcactccaa tcccggctgc 10680 
tgctggttac agcgggggag acatatatca 10740 
catgtggtgc ctactcctac tttctgtagg 10800 
aacggggacc taaacactcc aggccaatag 10860 
cttttttttt tttttttttt tttttttttt 10920 
cttttctttc ctttggtggc tccatcttag 10980 
tgagccgctt gactgcagag agtgctgata 11040 
11062 



<210> 24 
<211> 9605 
<212> DNA 

<213> Hepatitis C virus 
<400> 24 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaacgt aacaccaacc 
gtggtcagat cgtcggtgga gtttacctgt 
gcgcgactag gaagacttcc gagcggtcgc 
aggctcgcca gcccgagggt agggcctggg 
gcaatgaggg cttggggtgg gcaggatggc 
ggggccccac ggacccccgg cgtaggtcgc 
cgtgcggctt cgccgatctc atggggtaca 
ctgccagggc cctggcgcat ggcgtccggg 
ggaatctgcc cggttgctcc ttttctatct 
tcccagcttc cgcttatgaa gtgcgcaacg 
gctccaacgc aagcattgtg tatgaggcag 
tgccctgcgt tcgggagaac aactcctccc 
cggccaggaa cgctagcgtc cccactacga 
gggcggctgc tctctgctcc gctatgtacg 
tcgcccagct gttcaccttc tcgcctcgcc 
caatatatcc cggccacgtg acaggtcacc 
cacctacagc agccctagtg gtatcgcagt 
tggtggcggg ggcccattgg ggagtcctag 
actgggctaa ggttctgatt gtgatgctac 
tgacaggggg gacgatggcc aaaaacaccc 
catcccagaa aatccagctt gtaaacacca 
tgaactgcaa tgactccctc aacactgggt 
tcaactcatc tggatgccca gagcgcatgg 
aggggtgggg gcccatcact tacaatgagt 
ggcactacgc accccggccg tgcggtatcg 
actgcttcac cccaagccct gtcgtggtgg 
acagttgggg ggagaatgag acggacgtgc 
gcaactggtt tggctgtaca tggatgaata 
ccccgtgtaa catcgggggg atcggcaata 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
gccgcccaca ggacgtcaag ttcccgggcg 420 
tgccgcgcag gggccccagg ttgggtgtgc 480 
aacctcgtgg aaggcgacaa cctatcccca 540 
ctcagcccgg gtacccctgg cccctctatg 600 
tcctgtcacc ccgtggctct cggcctagtt 660 
gcaatttggg taaggtcatc gataccctca 720 
ttccgctcgt cggcgccccc ctagggggcg 780 
ttctggagga cggcgtgaac tatgcaacag 840 
tccttttggc tttgctgtcc tgtttgacca 900 
tatccggagt gtaccatgtc acgaacgact 960 
cggacatgat catgcatacc cccgggtgcg 1020 
gctgctgggt agcgctcact cccacgctcg 1080 
cgatacgacg ccatgtcgat ttgctcgttg 1140 
tgggagat'ct ctgcggatct gttttcctcg 1200 
ggcacgagac agtacaggac tgcaattgct 1260 
gtatggcttg ggatatgatg atgaactggt 1320 
tactccggat cccacaagct gtcgtggata 1380 
cgggccttgc ctactattcc atggtgggga 1440 
tctttgccgg cgttgacggg ggaacctatg 1500 
tcgggattac gtccctcttt tcacccgggt 1560 
acggcagctg gcacatcaac aggactgccc 1620 
tccttgctgc gctgttctac gtgcacaagt 1680 
ccagctgcag ccccatcgac gcgttcgctc 1740 
cacacagctc ggaccagagg ccttattgtt 1800 
tacccgcggc gcaggtgtgt ggtccagtgt 1860 
ggacgaccga ccggttcggc gtccctacgt 1920 
tgcttcttaa caacacgcgg ccgccgcaag 1980 
gcactgggtt caccaagacg tgcgggggcc 2040 
aaaccttgac ctgccccacg gactgcttcc 2100 
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ggaagcaccc cgaggccact tacaccaagt 
gcttggtcca ctacccatac aggctttggc 
tcaaggttag gatgtacgtg gggggagtgg 
ctcgaggaga gcgttgtaac ctggaggaca 
tgtctacaac ggagtggcag gtattgccct 
ctggtttgat ccatctccat cagaacgtcg 
cggcggttgt ctcctttgca atcaaatggg 
cggacgcgcg cgtctgtgcc tgcttgtgga 
ccctagagaa cctggtggtc ctcaacgcgg 
ccttcctcgt gttcttctgt gctgcctggt 
catatgccct ctacggcgta tggccgctac 
catacgccat ggaccgggag atggcagcat 
tactcttgac cttgtcaccg cactataagc 
aatattttat caccagggcc gaggcacact 
gggggggccg cgatgccgtc atcctcctca 
ccatcaccaa aatcttgctc gccatactcg 
ccaaagtgcc gtacttcgtg cgcgcacacg 
aggttgctgg gggtcattat gtccaaatgg 
cgtacgttta tgaccatctc accccactgc 
ttgcggtggc agttgagccc gtcgtcttct 
gggcagacac cgcggcgtgt ggggacatca 
ggagggagat acatctggga ccggcagaca 
cgcctattac ggcctactcc caacagacgc 
tcacaggccg ggacaggaac caggtcgagg 
aatctttcct ggcgacctgc gtcaatggcg 
caaagaccct tgccggccca aagggcccaa 
acctcgtcgg ctggcaagcg ccccccgggg 
gctcggacct ttacttggtc acgaggcatg 
acagcagggg gagcctactc tcccccaggc 
gtccactgct ctgcccctcg gggcacgctg 
gaggggttgc gaaggcggtg gactttgtac 
ccccggtctt cacggacaac tcgtcccctc 
atctacacgc ccctactggt agcggcaaga 
aagggtataa ggtgcttgtc ctgaacccgt 
atatgtctaa ggcacatggt atcgacccta 
cgggtgcccc catcacgtac tccacctatg 
ggggcgccta tgacatcata atatgtgatg 
tgggcatcgg cacagtcctg gaccaagcgg 
ccaccgctac gcctccggga tcggtcaccg 
tgtccagcac tggagaaatc cccttttatg 
gggggaggca cctcattttc tgccattcca 
tgtccggcct cggactcaat gctgtagcat 
caactagcgg agacgtcatt gtcgtagcaa 
atttcgactc agtgatcgac tgcaatacat 
acccgacctt caccattgag acgacgaccg 
ggcgaggcag gactggtagg ggcaggatgg 
ggccctcggg catgttcgat tcctcggttc 
ggtacgagct cacgcccgcc gagacctcag 
ggttgcccgt ctgccaggac catctggagt 
acatagacgc ccatttcttg tcccagacta 
tagcatacca ggctacggtg tgcgccaggg 
tgtggaagtg tctcatacgg ctaaagccta 
ggctgggagc cgttcaaaac gaggttacta 
catgcatgtc ggctgacctg gaggtcgtca 
tagcagctct ggccgcgtat tgcctgacaa 
tcttgtccgg aaagccggcc atcattcccg 
agatggaaga gtgcgcctca cacctccctt 
aattcaaaca gaaggcaatc gggttgctgc 
ctcccgtggt ggaatccaag tggcggaccc 
atttcatcag cgggatacaa tatttagcag 
tagcatcact gatggcattc acagcctcta 
t.cctgtttaa catcctgggg ggatgggtgg 
ctgctttcgt aggcgccggc atcgctggag 
tgcttgtgga tattttggca ggttatggag 
aggtcatgag cggcgagatg ccctccaccg 
tctcccctgg cgccctagtc gtcggggtcg 



gtggttcggg gccttggttg acacccagat 2160 
actacccctg cactgtcaac tttaccatct 2220 
agcacaggct cgaagccgca tgcaattgga 2280 
gggacagatc agagcttagc ccgctgctgc 2340 
gttccttcac caccctaccg gctctgtcca 2400 
tggacgtaca atacctgtac ggtatagggt 2460 
agtatgtcct gttgctcttc cttcttctgg 2520 
tgatgctgct gatagctcaa gctgaggccg 2580 
catccgtggc cggggcgcat ggcattctct 2640 
acatcaaggg caggctggtc cctggggcgg 2700 
tcctgctcct gctggcgtta ccaccacgag 2760 
cgtgcggagg cgcggttttc gtaggtctga 2820 
tgttcctcgc taggctcata tggtggttac 2880 
tgcaagtgtg gatccccccc ctcaacgttc 2940 
cgtgcgcgat ccacccagag ctaatcttta 3000 
gtccactcat ggtgctccag gctggtataa 3060 
ggctcattcg tgcatgcatg ctggtgcgga 3120 
ctctcatgaa gttggccgca ctgacaggta 3180 
gggactgggc ccacgcgggc ctacgagacc 3240 
ctgatatgga gaccaaggtt atcacctggg 3300 
tcttgggcct gcccgtctcc gcccgcaggg 3360 
gccttgaagg gcaggggtgg cgactcctcg 3420 
gaggcctact tggctgcatc atcactagcc 3480 
gggaggtcca agtggtctcc accgcaacac 3540 
tgtgttggac tgtctatcat ggtgccggct 3600 
tcacccaaat gtacaccaat gtggaccagg 3660 
cgcgttcctt gacaccatgc acctgcggca 3720 
ccgatgtcat tccggtgcgc cggcggggcg 3780 
ccgtctccta cttgaagggc tcttcgggcg 3840 
tgggcatctt tcgggctgcc gtgtgcaccc 3900 
ccgtcgagtc tatggaaacc actatgcggt 3960 
cggccgtacc gcagacattc caggtggccc 4020 
gcactaaggt gccggctgcg tatgcagccc 4080 
ccgtcgccgc caccctaggt ttcggggcgt 4140 
acatcagaac cggggtaagg accatcacca 4200 
gcaagtttct tgccgacggt ggttgctctg 4260 
agtgccactc aactgactcg accactatcc 4320 
agacggctgg agcgcgactc gtcgtgctcg 4380 
tgccacatcc aaacatcgag gaggtggctc 4440 
gcaaagccat ccccatcgag accatcaagg 4500 
agaagaaatg tgatgagctc gccgcgaagc 4560 
attaccgggg ccttgatgta tccgtcatac 4620 
cggacgctct aatgacgggc tttaccggcg 4 680 
gtgtcaccca gacagtcgac ttcagcctgg 4740 
tgccacaaga cgcggtgtca cgctcgcagc 4800 
gcatttacag gtttgtgact ccaggagaac 4860 
tgtgcgagtg ctatgacgcg ggctgtgctt 4920 
ttaggttgcg ggcttaccta aacacaccag 4980 
tctgggagag cgtctttaca ggcctcaccc 5040 
agcaggcagg agacaacttc ccctacctgg 5100 
ctcaggctcc acctccatcg tgggaccaaa 5160 
cgctgcacgg gccaacgccc ctgctgtata 5220 
ccacacaccc cataaccaaa tacatcatgg 5280 
cgagcacctg ggtgctggta ggcggagtcc 5340 
caggcagcgt ggtcattgtg ggcaggatca 5400 
acagggaagt cctttaccgg gagttcgatg 54 60 
acatcgaaca gggaatgcag ctcgccgaac 5520 
aaacagccac caagcaagcg gaggctgctg 5580 
tcgaagcctt ctgggcgaag catatgtgga 5640 
gcttgtccac tctgcctggc aaccccgcga 5700 
tcaccagccc gctcaccacc caacataccc 57 60 
ccgcccaact tgctcctccc agcgctgctt 5820 
cggctgttgg cagcataggc cttgggaagg 5880 
caggggtggc aggcgcgctc gtggccttta 5940 
aggacctggt taacctactc cctgctatcc 6000 
tgtgcgcagc gatactgcgt cggcacgtgg 6060 



32662.doc 



WO 01/89364 



PCT/US01/16822 



gcccagggga gggggctgtg cagtggatga 
accacgtctc ccccacgcac tatgtgcctg 
tcctctctag tcttaccatc actcagctgc 
actgctccac gccatgctcc ggctcgtggc 
tgttgactga tttcaagacc tggctccagt 
ccttcttctc atgtcaacgt gggtacaagg 
ccacctgccc atgtggagca cagatcaccg 
tggggcctag gacctgtagt aacacgtggc 
cgggcccctg cacgccctcc ccggcgccaa 
ctgaggagta cgtggaggtt acgcgggtgg 
ctgacaacgt aaagtgcocg tgtcaggttc 
gggtgcggtt gcacaggtac gctccagcgt 
tcctggtcgg gctcaatcaa tacctggttg 
acgtagcagt gctcacttcc atgctcaccg 
agcgtaggct ggccagggga tctcccccct 
ctgcgccttc cttgaaggca acatgcacta 
tcgaggccaa cctcctgtgg cggcaggaga 
aaaataaggt agtaattttg gactctttcg 
aagtatccgt tccggcggag atcctgcgga 
tatgggcacg cccggattac aaccctccac 
tccctccagt ggtacacggg tgtccattgc 
cacggaggaa gaggacggtt gtcctgtcag 
tcgccacaaa gaccttcggc agctccgaat 
cctctcctga ccagccctcc gacgacggcg 
ccatgccccc ccttgagggg gagccggggg 
ccgtaagcga ggaggctagt gaggacgtcg 
gcgccctgat cacgccatgc gctgcggagg 
actctttgct ccgtcaccac aacttggtct 
ggcagaagaa ggtcaccttt gacagactgc 
tcaaggagat gaaggcgaag gcgtccacag 
cctgtaagct gacgccccca cattcggcca 
tccggaacct atccagcaag gccgttaacc 
aagacactga gacaccaatt gacaccacca 
aaccagagaa ggggggccgc aagccagctc 
gtgtgtgcga gaaaatggcc ctttacgatg 
gctcttcata cggattccaa tactctcctg 
ggaaagcgaa gaaatgccct atgggcttcg 
tcactgagaa tgacatccgt gttgaggagt 
aagccagaca ggccataagg tcgctcacag 
attctaaagg gcagaactgc ggctatcgcc 
gctgcggtaa taccctcaca tgttacttga 
tccaggactg cacgatgctc gtatgcggag 
ggacccaaga ggacgaggcg agcctacggg 
ccccccctgg ggacccgccc aaaccagaat 
ccaatgtgtc agtcgcgcac gatgcatctg 
ccaccacccc ccttgcgcgg gctgcgtggg 
ggctaggcaa catcatcatg tatgcgccca 
atttcttctc catccttcta gctcaggaac 
acggggcctg ttactccatt gagccacttg 
gccttagcgc attttcactc catagttact 
goctcaggaa acttggggta ccgcccttgc 
gcgctaggct actgtcccag ggggggaggg 
gggcagtaag gaccaagctc aaactcactc 
ccagctggtt cgttgctggt tacagcgggg 
gaccccgctg gttcatgtgg tgcctactcc 
tccccaaccg atgaacgggg acctaaacac 
cctttttttt tttctttttt tttttttttt 
tcctcttttt ttccttttct ttcctttggt 
tgtgaaaggt ccgtgagccg cttgactgca 
caagt 
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accggctgat agcgttcgct tcgcggggta 6120 
agagcgacgc tgcagcacgt gtcactcaga 6180 
tgaagaggct tcaccagtgg atcaacgagg 6240 
taagagatgt ttgggattgg atatgcacgg 6300 
ccaagctcct gccgcgattg ccgggagtcc 6360 
gagtctggcg gggcgacggc atcatgcaaa 6420 
gacatgtgaa aaacggttcc atgaggatcg 6480 
atggaacatt ccccattaac gcgtacacca 6540 
attattctag ggcgctgtgg cgggtggctg 6600 
gggatttcca ctacgtgacg ggcatgacca 6660 
cggcccccga attcttcaca gaagtggatg 6720 
gcaaacccct cctacgggag gaggtcacat 6780 
ggtcacagct cccatgcgag cccgaaccgg 6840 
acccctccca cattacggcg gagacggcta 6900 
ccttggccag ctcatcagct atccagctgt 6960 
cccgtcatga ctccccggac gctgacctca 7020 
tgggcgggaa catcacccgc gtggagtcag 7080 
agccgctcca agcggaggag gatgagaggg 7140 
ggtccaggaa attccctcga gcgatgccca 7200 
tgttagagtc ctggaaggac ccggactacg 7260 
cgcctgccaa ggcccctccg ataccacctc 7320 
aatctaccgt gtcttctgcc ttggcggagc 7380 
cgtcggccgt cgacagcggc acggcaacgg 7440 
acgcgggatc cgacgttgag tcgtactcct 7500 
atcccgatct cagcgacggg tcttggtcta 7560 
tctgctgctc gatgtcctac acatggacag 7620 
aaaccaagct gcccatcaat gcactgagca 7680 
atgctacaac atctcgcagc gcaagcctgc 7740 
aggtcctgga cgaccactac cgggacgtgc 7800 
ttaaggctaa acttctatcc gtggaggaag 7860 
gatctaaatt tggctatggg gcaaaggacg 7920 
acatccgctc cgtgtggaag gacttgctgg 7980 
tcatggcaaa aaatgaggtt ttctgcgtcc 8040 
gccttatcgt attcccagat ttgggggttc 8100 
tggtctccac cctccctcag gccgtgatgg 8160 
gacagcgggt cgagttcctg gtgaatgcct 8220 
catatgacac ccgctgtttt gactcaacgg 8280 
caatctacca atgttgtgac ttggcccccg 8340 
agcggcttta catcgggggc cccctgacta 8400 
ggtgccgcgc gagcggtgta ctgacgacca 8460 
aggccgctgc ggcctgtcga gctgcgaagc 8520 
acgaccttgt cgttatctgt gaaagcgcgg 8580 
ccttcacgga ggctatgact agatactctg 8640 
acgacttgga gttgataaca tcatgctcct 8700 
gcaaaagggt gtactatctc acccgtgacc 8760 
agacagctag acacactcca gtcaattcct 8820 
ccttgtgggc aaggatgatc ctgatgactc 8880 
aacttgaaaa agccctagat tgtcagatct 8940 
acctacctca gatcattcaa cgactccatg 9000 
ctccaggtga gatcaatagg gtggcttcat 9060 
gagtctggag acatcgggcc agaagtgtcc 9120 
ctgccacttg tggcaagtac ctcttcaact 9180 
caatcccggc tgcgtcccag ttggatttat 9240 
gagacatata tcacagcctg tctcgtgccc 9300 
tactttctgt aggggtaggc atctatctac 9360 
tccaggccaa taggccatcc tgtttttttc 9420 
tttttttttt ttttttttct cctttttttt 9480 
ggctccatct tagccctagt cacggctagc 9540 
gagagtgctg atactggcct ctctgcagat 9600 
9605 
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